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ABSTRACT 

Systems described by parabolic partial differential equations are formu- 
lated as ordinary differential equations in a Sobolev space of a given 
order. Quadratic cost criteria are then formulated in terms of inner 
products on this Sobolev space. Existence of an optimal control is 
proved both in the case where the system operator is coercive and in 
the case where the system operator is the infinitesimal generator of a 
semigroup of operators. The optimal control is given by a linear 
state feedback law. The feedback operator is shown to be the bounded, 
positive, self-adjoint solution of a nonlinear operator equation of the 
Riccati type. This operator can also be represented by an integral 
operator whose kernel satisfies a Riccati-like integro -differential 
equation. 1 

These results are specialized, in a straightforward manne;r, to the 
case of pointwise control. The optimal pointwise control is given by a 
simplified linear control law which depends on the control point lo- 
cation. The general results are also specialized to obtain the modal 
approximation to the pointwise control problem and to demonstrate 
the optimality of output feedback for a particular class of output trans- 
formations. The pointwise feedback control laws, in these cases, are 
characterized, structurally, by a measurement ope ration which is in- 
dependent of control point location and a gain operation which is di- 
rectly dependent on control point location. Several examples relating 
to the scalar heat equation are solved. 
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CHAPTER I 


INTRODUCTION 

A great deal of research in recent years has been devoted to the 
optimal control of distributed parameter systems. With few exceptions 
this work has focused on the use of distributed, rather than lumped, 
controls. In many cases of practical interest, however, it is actually 
desired to control distributed parameter systems by means of finite - 
dimensional controls. Examples of such cases would be the control of 
the flexure of a launch booster using only rocket thrust and control of 
wing and fuselage flexure in aircraft by means of rudder, flap, aileron, 
and spoiler manipulation. In both of these examples the controls are 
a finite number of pointwise controls, that is, controls applied at 
isolated points within the spatial domain of definition of the distributed • 
parameter system. Other examples of systems in which pointwise 
control might be applied are heat diffusion systems, systems described 
by wave equations such as longitudinally vibrating beams and trans- 
mission lines, transversely deflecting flexible beams, and mechanically 
vibrating systems . 

Traditional approaches to solving this type of problem include 
solving for an optimal distributed control and then approximating the 
distributed control by a finite number of lumped controls, or the modal 
approach, as used by Johnson,^ for example, in which the system is 
assumed to be adequately described by a finite number of modes and the 
resulting finite -dimensional optimization problem is solved for the 
optimal modal control. The former approach becomes inadequate if 
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we wish to determine feedback control laws rather than open-loop con- 
trols, and the latter fails when the system cannot be described by a 
countable number of modes, when the number of modes necessary for 
adequate description of the system is prohibitively large, when it is 
computationally difficult to determine the eigenvalues and eigenfunctions, 
or when it is difficult to judge which modes are indeed the dominant 
ones . 

The purpose of this research is to formulate the pointwise control 
problem as a distributed parameter control problem and to present a 
unified approach to solving this problem within the framework of 
existing distributed parameter control theory. The distributed param- 
eter systems we shall consider are described by parabolic and hyper- 
bolic partial differential equations. Examples of parabolic partial 
differential equations are the scalar heat diffusion equation and the heat 
diffusion equation in the plane. Wave equations and the equation for 
transverse deflection of a flexible beam fall within the category of 
hyperbolic partial differ entiah equations . The cost functional is 
quadratic in the deviation of the state distribution from a desired 
distribution and in the control energy. The choice of such a cost 
functional is motivated by -.the desire to derive, in the distributed 
parameter case, results of comparable elegance to those of finite - 
dimensional control problems with quadratic cost- -namely, linear 
feedback control laws and simply expressed quadratic optimal cost 
Junctions . - 

There are many approaches to the solution of general distributed 

parameter control problems.- One of the earliest systematic approaches 

2 ' 

was that of Butkovskii's . He presents a maximum principle for distri- 
buted parameter optimal control problems analogous to Pontryagin 1 s 
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maximum principle for lumped parameter control problems. The distri- 
buted parameter systems which Butkovskii considers are those de- 
scribed by systems of integral equations and the necessary conditions 
for optimality which he obtains by variational techniques are also in 
the form of integral equations. Since we shall consider systems de- 
scribed by parabolic and hyperbolic partial differential equations, our 
control problem is not in a form in which the Butkovskii maximum 
principle is immediately appli cable . There are methods, namely 
Green's function techniques , whereby the partial differential equation 
description of a distributed parameter system may be transformed to 
an equivalent integral equation description, but these techniques tend 
to be difficult to apply to the general classes of spatial differential 

operators we shall consider. 

3 

Wang derives a maximum principle for distributed parameter 
systems described by partial differential equations by using a dynamic 
programming procedure. The necessary conditions he obtains are in 
the form of partial differential equations. An unfortunate aspect of 
Wang's maximum principle is that, although it is systematic in princi- 
ple, there is no systematic way of treating boundary conditions. More- 
over, in a strict mathematical sense, it is impossible to prove existence 
and uniqueness of optimal solutions in the function space in which Wang 
formulates his control problems. 

A step in the direction of formulating distributed parameter con- 
trol problems in a form more amenable to the application of well-known 

4 

system theoretic concepts is taken by Balakrishnan, who considers the 
state distribution in the distributed system to be a point in some Banach 
space and then regards the partial differential equation describing the 
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time evolution of the state distribution to be an ordinary differential 
equation in the strong topology of the Banach space. He uses the as- 
sumption that the system spatial differential operator is the infinitesi- 
mal generator of a semigroup of operators, the infinite dimensional 
analogy of the transition matrix in finite dimensional systems, and 

proceeds to solve final value problems and time -optimal problems by 

5 

means of well-known functional analytic methods. Fattorini works 
along these same lines in considering the controllability of distributed 
parameter systems containing both distributed control and boundary 
control. Unfortunately for the problem we wish to consider, or, more 

precisely, for quadratic cost functionals, the Banach spaces used by 

* 

Balakrishnan and Fattorini are much too general. 

There have been several applications of the above techniques. 
Egorov 0 ’ attacks a problem with both interior and boundary control. 

He considers the system partial differential equations and boundary 
condition equations as dynamics and introduces appropriate adjoint 
variables to obtain a maximum principle separated into an interior 

inequality and a boundary inequality. 

8 1 

Sakawa considers linear one -dimens ional distributed parameter 

systems, with boundary control, as represented by integral equations, 

and, using variational techniques , derives integral equation necessary 

conditions which are simpler in form, but less general in application, 

than Butkovskii's maximum principle, 

q 

Yeh and Tou treat systems in which the controlled object moves 
continuously through the plant with a constant velocity. With the control 
assumed to be constrained in magnitude, the authors minimize a 
quadratic criterion via Butkovskii's maximum principle. The optimal 
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control is shown to be the solution of a Fredholm integral equation of 
the second kind. 

Kim and E rzberger'*' ^ also consider the minimization of a quadratic 
cost functional, using a dynamic programming approach to obtain a set 
of functional equations analogous to the matrix Riccati equation for 
lumped systems . They solve these equations by a method based on the 
eigenfunction representation of the Green's function. 

Axelband^ solves the problem of minimizing the norm of the dif- 
ference of a distributed parameter system output from a desired output 
by the use of a functional analytic formulation similar to Balakrishnan 1 s . 

He obtains an optimal solution by a convex programming algorithm. 

12 13 

Sirazetdinov ’ considers a quadratic cost functional and, using 
stability theory and dynamic programming arguments, proves the 
optimality of a distributed control law which is linear in the state of 
the system and derives integro -differential equations for the coefficients 
of the optimal cost function. He applies this to the problem of con- 
trolling aerodynamic and elastic deformation of an airframe, 

14 

Yavin and Sivan treat the optimal control of longitudinal vibrations 
in a flexible rod' held fixed at one end. From a partial differential 
equation formulation they obtain the proper Green's function for trans- 
formation to an integral equation. Using a quadratic criterion and a 
control applied at the force end, they obtain necessary conditions in the 
form of a Fredholm equation of the second kind. An approximate open- 
loop control is obtained by approximating the kernel by a sequence of 
degenerate kernels. 

15 

In a recent book. Lions formulates quadratic distributed parameter 
control problems in Hilbert spaces in which the terms of the quadratic 
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cost functional maybe written as inner products. He shows, for sys- 
tems described by spatial differential operators satisfying a certain 
definiteness condition, that solutions to the system equation exist and 
are continuous with i-espect to the control in Hie topology of the Sobolev 
space of order equivalent to the order of the system spatial differential 
operator. Using these Sobolev spaces, he is able to prove the existence 
and uniqueness of an optimal control and to determine the necessary 
conditions for the optimality of this control. Moreover, he shows that 
the optimal control is specified by a linear feedback control law and that 
the feedback operator satisfies a differential equation similar to the 
matrix Riccati equation obtained for finite -dimensional systems. 

Lions' results are the foundations upon which this research is 
built. We shall extend the class of system spatial differential operators 
considered by Lions to include those which are infinitesimal generators 
of semigroups of operators and will show that the results obtained by 
Lions for his more restrictive class also hold in the more general case. 

A fact of key importance which we shall use is that differential operators 
defined on a Sobolev space are closed operators in the topology of that 
Sobolev space. This is one of the requirements for an operator to be 
the infinitesimal generator of a semigroup of operators. Another useful 
feature of Sobolev spaces is that boundary conditions become easy to 
handle when the state space for the system is a Sobolev space. We 
shall also show that distributed systems driven by finite dimensional 
controls (the pointwise control problem) fall within the framework of 
this formulation and the results obtained for a general class of controls 

are specialized to the case of pointwise control in a straightforward 

1 6 

manner. It should be noted that Russell attacks the problem of con- 
strained pointwise control with a minimum system energy cost functional. 
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He does not develop a Hilbert space formulation of the problem and he 
circumvents the unboundedness of the system spatial differential oper- 
ator by assuming that his initial states have bounded spatial derivatives. 
By treating our pointwise control problem as a special case of a general 
quadratic optimization problem in a Sobolev space, we need not con- 
sider any such confining assumptions on the initial conditions. 

The organization of the thesis is as follows: Chapter II provides 
the mathematical background necessary for the formulation of parabolic 
and hyperbolic optimal control problems. The material is presented in 
such a form as to point out continually the relationships between 
infinite dimensional and finite dimensional system theoretic concepts. 
Sobolev spaces of finite order are defined by means of distribution 
theory. Elliptic differential operators of the coercive and, more 
general, strongly elliptic type are defined on these Sobolev spaces. It 
is then shown that parabolic and hyperbolic partial differential equations 
maybe written as ordinary differential equations in the Sobolev space. 
The remainder of the chapter is devoted to semigroups of operators -- 
their definition, the concept of infinitesimal generator, and the pre- 
sentation of a formula analogous to the variation of constants formula 
in finite dimensional systems. 

In Chapter HI we present the precise mathematical formulation of 
the parabolic and hyperbolic optimal control problems , The parabolic 
.control problem is then specialized to the case of pointwise control. 

Chapter IV is concerned with the solution of the parabolic optimal 
control problem in both the case where the system operator is assumed 
to be coercive and in the case where the system operator is assumed to 
be the infinitesimal generator of a semigroup of operators. The path to 
a solution first involves proving that a unique solution indeed exists. 
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In the coercive system operator case it will be shown that because of 
continuity of the optimal state in the optimal control, the optimal con- 
trol is given by a linear state feedback law in which the feedback 
operator is the solution of a Riccati operator differential equation. 
Under the assumption that the system operator is the infinitesimal 
generator of a semigroup of operators this continuity relation is not 
easily demonstrable, but it is shown that if a solution of the Riccati 
operator equation exists, then the optimal control is given by a linear 
feedback control law. It will then be shown that such a solution does 
exist. The remainder of the chapter contains a discussion of the be- 
havior of optimal solutions when the terminal time approaches in- 
finity and an alternative formulation of bounded operators on a Sobolev 
space as integral operators and the subsequent modification of the 
Riccati operator equation. 

With optimal solutions to the parabolic control problem having beer 
determined for general control spaces, we specialize the results to the 
case of pointwis e-control in Chapter V and show that the optimal feed- 
back operator in the pointwise control case is of a simpler form from a 
computational point of view. The second part of this chapter is con- 
cerned with the infinite terminal time pointwise control problem. It 
will be shown that by a judicious choice of the quadratic cost functional 
the modal analytic formulation of the pointwise control problem is ob- 
tained. This approach will enable us to make conclusions about the 
optimality of modal analytic solutions which we are unable to make by 
the straightforward techniques of modal analysis alone. We then con- 
sider the case where the entire state is not available to be fed back, 
but only the outputs of a finite number of measuring devices . It will 
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be shown that if the measuring devices are of a certain class, then the 
optimal control law will consist of feeding back only the outputs of 
these devices. 

The concluding Chapter VI contains a summary of the results ob- 
tained and recommendations for further research. 

It should be stressed that throughout the thesis a general class of 
distributed parameter optimal control problems will be solved, and the 
results will be specialized so as to obtain results in the pointwise 
optimal control problem and to obtain insight into the modal analytic 
and measurement problems . 



CHAPTER II 


MATHEMATICAL BACKGROUND 
2.1 INTRODUCTION 

The purpose of this chapter is to lay the mathematical foundation 
for the discussions and derivations in succeeding chapters. The vari- 
ous results stated in this chapter do not exhaustively cover the field of 
differential operators and partial differential equations, but serve to 
form a relatively complete set of tools to be applied to the problems of 
interest. The guiding philosophy for both choice of results to be dis- 
cussed in this chapter and direction of theory in the sequel is the at- 
tempt to provide results for distributed parameter systems which are 
roughly parallel to known results in lumped parameter theory. In 
order to achieve this parallelism, related concepts in distributed 
parameter theory must be provided for such lumped parameter system 
concepts as state and state space, matrix operators, equations of state, 
transition matrices , and variation of constants formulae. 

Section 2.2 is concerned with the concept of state in distributed 
parameter systems and the discussion of particular spaces of (general- 
ized) functions which serve as state spaces for systems described by 
partial differential equations. 

The reason for the choice of the spaces in Section 2.2 is made 
more clear when spatial differential operators are discussed in 
Section 2.3 and it is seen that elements of these spaces have sufficient 
smoothness to qualify as elements of the domain of differential oper- 
ators. The properties of coercivity and strong ellipticity of differential 


- 10 - 
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operators are treated in this section. The distinction between these 
two concepts will not be apparent until necessary conditions for optim- 
ality are discussed in Chapter IV. 

Parabolic and hyperbolic partial differential equations and their 
boundary conditions are introduced in Sections 2.4 and 2.5. Emphasis 
is placed throughout these two sections on the fact that these equations 
serve as equations of state exactly as ordinary differential equations 
describe the evolution of finite -dimens ional state variables. 

In Section 6 the concept of a semigroup of operators, the analog 
of the transition matrix in the finite dimensional case, is defined and 
explored. In addition to the properties of these semigroups the manner 
in which an operator may generate a semigroup of operators is dis- 
cussed. This is further elaborated on m Section 2.7 where strong 
ellipticity is shown to be a sufficient condition for a differential oper- 
ator to be the infinitesimal generator of a semigroup of operators. 

The final section of the chapter contains the relation of the semi- 
group of operators generated by the system operator of a forced (.con- 
trolled) system to solutions of this system. This expression for so- 
lutions of the forced system corresponds directly to the variation of 
constants formula for the state of a finite dimensional forced system. 

2.2 DISTRIBUTION THEORETIC CONCEPTS AND 

SOBOLEV SPACES 

The state of a finite dimensional system can be identified as a 
point in a finite dimensional Euclidean vector space. In distributed 
parameter systems the state is a function, at each instant of time, de- 
fined on the given spatial region, or, alternatively, the state is a point 
in an infinite dimensional (function) space. For the purpose of pre- 
paration for our subsequent study of quadratic performance criteria, 
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attention will be focused on the Hilbert space of square integrable func- 
tions on the spatial region of definition. As will be shown, this space 
is not quite suitable for distributed parameter applications, but certain 
subspaces, namely the Sobolev spaces of finite order, are. As a pre- 
liminary to the definition of Sobolev spaces, a brief discussion of 
distribution theory is required. 

Let us denote by D and 9D the spatial region of definition and 
its boundary. The variable z is used to denote a point in D. Further 

CO 

Let C q (D) be the space of infinitely differentiable functions of compact 

CD 

support on D. The space of bounded linear functionals on C q (D) (i.e., 

CO 

the dual of C q (D)) is called the space of distributions on D and is 
denoted by (D) . An element F of (D) has the form 

F{<b) = J f ( z)<j>( z) d z V cf>ec“(D} 

D 

where f( - ) is some Lebesque integrable function on D. The most 
familiar exampl e of a distribution is the Dirac 6-function or impulse, 
6(z-z ! ), which is the linear functional 

A(<{>) = J 6(z-z')<j>(z)dz = tf>(z') 

D 

There are several properties of the space of distributions which 
we shall exploit. First, the space of square integrable functions on D, 
L^(D), is a subset of the space of distributions. This is easily seen by 

co 2 . 

noting the fact that C Q (D)CrL D) (any infinitely differentiable function 

with compact support in D must be square integrable on D) and, 

2 

therefore, the dual space of L (D) must be contained in the dual space 
of C q (D), namely (D). Since 1? (D) is its own dual the following in- 
clusion relation holds 
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C®(D)CL Z (D) 


■A 


P) 


The second property of distributions which it is useful to exploit is the 
unique specification of the derivative of a distribution. If D is a 
region in n-dimensional Euclidean space and z is the n-tuple 
(Zp z^, . . . , z ) , 8F/9z^ for some Fe<£f(D) is uniquely specified by 

=/l£< z,< t’< z)dz = -/« z »ffy» dz = 


D 


D 


What, in effect, has been achieved is the ability to specify a meaningful 
expression for the operation of differentiation of any distribution, or, 
more to the point, differentiation is defined for all elements of L 2 (D). 

This generalized approach to differentiation can be extended to 
more complicated differential operators. Introducing the following 
notation, we let 

n 

q = (q 1 ,q 2 » • • *q n ) hi = (z.z.i) 

i=l 

where q is a positive integer for i=l,2, . . . n, and defining the dif- 
ferential operator 

„ q, q 7 q_ q 

D q = D, D, . . .D , with D. = •—■ (2.2.2) 

12 n l 9z. v ' 

l 

then for each Fec^(D) 

D q F((j>) = (-l)i q lF(D q <j>) V<j>eC"P) 

Let us make the following definition 


Definition 2.1 : The subset of oU (D), denoted by H m (D), with 

the property 

H m (D) = (Fe/(D) :FeL 2 (D) , D q FeL 2 (D) Vq , [q|<m> 
is called the Sobolev space of order m, with m an integer. 



-14- 


Moreover, defining the following inner product for F, GeH (D) 


< F , G> 


|q|<m 

H* 11 (D) can be shown to be complete in the topology induced by this 
inner product {see Ref. 1*7). Chapter 4), and, therefore, H m (D) is a 
Hilbert space. 

The usefulness of the Sobolev space H m (D) can be understood 
when it is recalled exactly what are the useful properties of finite - 
dimensional state spaces. First, any finite dimensional space is com- 
plete and any operator (matrix) on this space is everywhere defined. 

The differential operator D q , as described above, is everywhere de- 
■ 00 # t 

fined on C q (D), the space of infinitely differentiable functions with 
compact support in D. Unfortunately, there is no norm topology for 
which this space has the completeness property of finite dimensional 
vector spaces. The second useful property of finite dimensional 
spaces is the fact that all linear operators on these spaces are closed. 
If L. 2 (D) is taken to be the space on which D q is defined (in this case 
only densely defined), D q is not a closed operator. If I? is the 
closure of D q on L?(D), then the domain of D q would contain non- 
differentiable functions. By the artifice of introducing distributions 
we are able to define the derivative even for non -differentiable func- 


<D q F,D q G> 


L 2 (D) 


tions, and it is easily seen that the non -differentiable functions in the 

domain of the closure of D q are those functions F in L (D) for 

2 

which D q F is in L (D) . More succinctly, the domain of the closure 
of D q is H m (D) for some m. With the inner product defined above 
for H m (D) the Sobolev space of order m has the very useful property 
of completeness. Thus, it is seen that Sobolev spaces fill the bill as 
candidates for distributed parameter state spaces. 
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2.3 DIFFERENTIAL OPERATORS 

With the introduction of Sobolev spaces as the prototype of a 
state space for distributed parameter systems, it remains to be dis- 
cussed what exactly are the properties of spatial differential operators, 
which play the role in distributed parameter systems which matrices 
play in lumped parameter systems. Some of these properties were 
touched on in the preceding section as part of the justification of the 
usefulness of Sobolev spaces. It was shown, in essence, that a dif- 
ferential operator of order m is everywhere defined (with the aid of 
distribution theory) and closed on H m (D) . This is, however, all that 
linear differential operators have in common with linear operators in 
finite dimensional spaces. 

The first property which characterizes differential operators is 
the fact that they are not bounded. This, aside from the infinite di- 
mensionality, is the single most complicating factor in distributed 
parameter systems. It causes difficulty in proving existence of so- 
lutions to partial differential equations, and, in contradistinction to 
finite dimensioned systems, necessitates that great pains must be .taken 
in characterizing these solutions, as will be seen m Sections 7 and 8 of 
this chapter . 

The particular type of differential operators which will be con- 
sidered, as indicated somewhat by the operator in the preceding 

section are those of linear form, composed of partial derivatives with 
respect to each component of the spatial variable and of a specified 
order m. Embellishing the notation of -Section 2, let us introduce the 
real functions a (z), where q is the n-tuple defined by (2,2.1), and 
define the formal differential operator A, of order m 
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A 


X a q (^)D q 
kl £ m 


(2.3.1) 


where D 
notation 


q is the differential operator described in Eq. 


I 

k !< 


^ignifies the composite summation 


m 


2.2.2 and the 


I ■ I * I*... I 

l q l< m q q <1 

kk° kl=i kl= m 

with D° representing the zeroth order differential, or identity, 
operator. The particular nature of the functions aq(z) will be clari- 
fied in the discussions on coercivity and ellipticity. 

Just as the formal differential operator A has been defined, it 
is a straightforward matter to define the formal adjoint of A, denoted 
by A + , as 

A + = X (-l)^L q a q (z) (2.3.2) 

kl£m 

In general, the formal adjoint A + does not equal the adjoint operator 
A*, where A* satisfies 

< X > A Y> m = < A * x ’ y > 

H m (D) 

Indeed, it can be shown, by means of Green's Formula, .that 



<x,Ay> = <A + x,y>dC 

H m (D) 

where the constant C depends on conditions at the boundary 3D. In 
the case of Dirichlet boundary conditions, which will be discussed in 
Section 2.5, C=0 and the formal adjoint A*" equals the adjoint A v . 

We shall now discuss what is meant by an elliptic differential 
operator, and we shall subsequently define the properties of coercivity 
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and strong ellipticity of elliptic differential operators, which properties 
will play an important role in the optimization results of Chapter IV. 

If the functions a q (- ) are required to be essentially bounded functions, or, 

CO 

equivalently, are elements of the space L (D), then an m order dif- 
ferential operator of the form specified in Eq, 2.3.1 is said to be elliptic 
(see Ref. 13, p. 1704) if the inequality 

X a (z) / 0 for all £eR n , zeD 

kl=m q 

is satisfied. Note that this is a condition on the highest order term of 
the differential operator, i.e., the terms containing partial derivatives 
of order m. If we restrict our attention to elliptic differential oper- 
ators which contain only even order partial derivatives, we define the 
concept of coercivity in the following manner: 


Definition 2.2 : If A is an elliptic differential operator of the 


form 


A 


X a q (z)D^ 

h I 5 2 p 


where a q (z) =0 if |q| /2k, for k=0, 1, . . . ,p, then A is said to be 
coercive if the inequality 


( -1 ) k X a q (zK q <-a X ^ 

[ q | =2k | q | =2k 


(2.3.3) 


is satisfied for some a> 0, for k=0,l, .. .p, and for all t,eR n and zeD. 


This concept of coercivity arises from the use of this term by 
J. L. Lions (Ref. 15, p.22) to describe the property of operators more 

commonly referred to as "negative definiteness", namely the condition 
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< Ax,x> < - a ||xj| 2 

H m (D) H m (D) 

for some a > 0 and for all xeH m (D). It might be noted that just as 
negative definiteness of a matrix implies that the eigenvalues of the 
matrix lie on the negative real axis, the spectrum of a coercive oper- 
ator is a subset of the left half -plane. 

The condition for strong ellipticity is not as stringent, and a 
strongly elliptic operator is defined by: 

De finition 2 . 3: If A is an elliptic differential operator of even 

order 2p, then A is said to be strongly e lliptic if the inequality 

(-1) P X a q (z)£ q <-a X (2.3.4) 

Iqj=2p . |q|=2p 

is. satisfied for some a > 0, and for all f,eR n and zeD. 

Note that, unlike in the Inequality 2.3. 3 for the coercive operator case, 
the summation in Inequality 2.3.4 is taken over only the highest order 
terms of the operator A. All of the terms of a given order in the coerciv 
operator case must satisfy this type of inequality. Thus, coercivity 
implies strong ellipticity, but the converse does not hold. 

To illustrate coercive and strongly elliptic operators, let us con- 
sider the second order differential operator defined on some subset D 
of R Z 



The coefficients of this operator satisfy Inequality 2.3.3 f or k=0 and 
k=l if we choose a=l, implying that this operator is coercive (and, of 
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course, strongly elliptic) . If, on the other hand, we consider the dif- 
ferential operator 


A 


2 




+ 2 


we see that Inequality 2.3.3 is satisfied for k=l, but is not satisfied 
for k = 0, and, thus, the operator is not coercive. However, In- 

equality 2.3.4 is satisfied, which implies that A^ is a strongly elliptic 
operator . 

The fact that we do not consider more general operators than 
those described above is a reflection of the state of knowledge con- 
cerning differential operators and the fact that there are many physical 
systems of interest whose mathematical models have spatial differential 
operators falling within these categories. 

2.4 SYSTEM EQUATIONS --PARABOLIC AND HYPERBOLIC 

The purpose of this section is to tie together the concepts dis- 
cussed in the preceding two sections --namely, state, state space, and 
system differential operators - -and arrive at a description of a distri- 
buted parameter system in the form of one or more partial differential 
equations. This, of course, is indirect analogy with the equations of 
state in finite dimensional systems. The only ingredient missing up to 
now is the time variable. 

Let us consider functions x(t) defined on te[0,T] and having 
values in the Sobolev space H m (D), defined in Section 2 of this chapter, 
that is, x(t)eH m (D) Vte[ 0, T] . Just as was done in Section 2, these 
vector functions x{t) may be considered as points of a function space. 
Since emphasis has been placed on considering Hilbert spaces as state 
spaces, the space L^(0,T; H m (D)) is defined: 
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D efinition 2 . 4 : If x(t)eH m (D) for all te[0,T], then the square 
integrable Sobolev space -value d functions are 


1, 2 (0, T;H m (D)) - {x<.):x(t)fH m (D),¥te[ 0,T] 


and 


X 

/ IMII 


rf^D) 


dt< 


Note that this is a Hilbert space with inner product 

T 


< x ( • ) , y( • ) >■ 2 ^ 

ir(0,T;H tn (D)) Q 


= J <x(t),y(t)> 


H m (D) 


dt 


Since it is desired to represent physical distributed parameter 
systems, it is essential to be able to characterize partial differentiation 
by time. With the discussion of the distribution theoretic results in 
Section 2 the tools, are on hand to make this a straightforward pro- 
cedure. If we consider the space of infinitely differentiable Sobolev 
space -valued functions with compact support in [0,T] and its cor- 
responding dual space of distributions , which, for convenience, may be 
denoted by^ [ 0,T], then the following Sobolev space of Sobolev space- 
valued functions may be defined (see Ref. 15, p.. 115). 


Definition 2 , 5: W(0, T) is the set of Sobolev space -valued func- 
tions defined on [ 0, T] with the property 

W(0,T) = (x{.) :x(.)eL 2 (0,T;H m (D)) ; ^ x(. )eh 2 (0, T; H m (D))} 


This, as might be expected, is a Hilbert space with inner product 


<x('),y(-)> 


W(0,T) = <X '- ) ’ y( ' )> I ,2, 0 , T; H m (D,) 

, <y dx.C J. MJ > 

dt ’ dt D 2 (0,T,H m {D)) 
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We are now in a position to describe partial differential equations 
by ordinary differential equations in Sobolev space -valued functions. 
Two types of partial differential equations are considered --parabolic 
and hyperbolic. Parabolic equations are of the form: 

- Ax(t,z) + f(t,z) * (2.4.1) 

where A is an elliptic partial differential operator in the spatial vari- 
able z as described in Section 3 . If x(t, z), te(0,T], zeD is as- 
sumed to be the element x(t)eW(0,T), Eq . 2 . 4. 1 has the equivalent 
formulation as the ordinary diffe rential equation in L 2 (0 , T; H m (D)) 

-- x(t) = A x(t) + f(t) (2.4.2) 

where f(.)eL 2 (0,T; L 2 (D)) 

As might be expected from knowledge of the finite-dimensional prob- 
lem, an initial condition must be given so as to specify an exact so- 
lution of Eq. 2.4.2. If the initial data is given by x(0 , z) -x^(z) where 
xjz) has the representation x^eH^fD), then Eq. 2.4.2 has the initial 
condition 


x(0) = x 


(2.4.3) 


As an example of a parabolic equation, consider the single degree of 
freedom heat diffusion equation 


_ „ 8 x(t,z) 


at 


M- 


|r = constant 


dz 


where, of course, the operator A is j , an elliptic operator, and 

9z 2 

x(t, z) is a temperature distribution. 


*1' 

This description of distributed parameter systems is, of course, not 
complete without the specification of boundary conditions, which will 
be discussed in the next section. 
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Before discussing hyperbolic equations it is necessary to extend 
the previously defined state space to a two-dimensional form con- 
sisting of column vectors (x^ft), x^(t)) 3 where, for each value of 
te[0,T], x i (t)eH m (D), i=l,2. A more general spatial differential oper- 
ator must also be defined, namely, a 2X2 matrix the elements -of 
which are spatial differential operators as described in Section 2.3. 

The particular matrix operator to be discussed is: 

(2.4.4) 

where A is as defined above and I is the identity operator on H m (D) . 

We are now in a position to describe second order hyperbolic 
equations in terms of the state variables and state spaces of Section 2. .2 
Hyperbolic equations are of the form 


a = 


0 I 
A 0 


9_ x .(Ugl = A x(t, z) + f(t, z) (2.4.5) 

9t 

0 X ft 7. ) 

where A is elliptic. If x(t, z) and — g" — z -te[0,T], zeD are element 
of W(0,T), Eq. 2.4.5 has the first order vector ordinary differential 
equation representation: 


_d_ 

dt 


~x 1 (t)" 


" x 1 (t) “ 


0 




+ 


x 2 (t) 




f<t) 


(2.4.6) 


x^(t) = ’ an( ^ f { • ) is assumed to be an 


where x^(t) = x(t) 
element of L 2 (0, T; H m (D)) . 

Once again, initial conditions are required and this time they 
take the form of a 2-vector 


x(0) 


i 

o 

■ — i 

X 

1 



Lx 2 (°)_ 


— i 

0 

•X 

1 


x 

— o 


(2.4.7) 
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with x q and each being elements of H m (D) and repres enting the 

9 i 

initial data x{0,z) and -r~-x(t, z) 1 1=0 , respectively. 

dt 

An example of a hyperbolic equation is the equation which governs 
the magnitude of longitudinal vibrations in a rigid beam 


i£x(L_5l _ ,, d Z x(t, z) 


= M- 


at 


p = constant 


9 z 


where x(t,z) is the transverse deflection of the point z in the beam, at 

time t. The operator A is again the elliptic operator y. It should 

9z 

be stressed that the operator A is elliptic in both parabolic and hyper- 
bolic equations. 

These two classes of partial differential equations, though not 
general enough to describe all linear distributed parameter systems, 
describe a great number of physical systems, and, such being the case, 
are worthy of being the equations of state considered in a system theo- 
retic and, subsequently, control theoretic development. All of the 
elements analogous to system description in lumped parameter systems -- 
namely, state, state-space, system operator, and state equation- -have 
been introduced. One subject, boundary conditions, which are indigenous 
to distributed, but not in lumped, parameter systems, remains to be 
discussed in Section 5. 


2.5- BOUNDARY CONDITIONS 

This section is devoted to the discussion of boundary conditions to 
partial differential equations. This is a slight deviation from the stated 
purpose of this chapter--the development of a system theoretic ap- 
proach parallel to that of finite dimensional systems- -but one which is 
necessary for the sake of completeness . It will be shown that boundary 
conditions can be treated within the framework of the system theoretic 
notions developed in the preceding sections. Dirichlet and Neumann 
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boundary conditions will be defined and, for the case of Dirichlet 
boundary conditions, the compatibility with the state space conditions 
already given will be demonstrated in detail. 

If the differential operator A in either the parabolic system 
(2. 4.2) and (2. 4.3) or the hyperbolic system (2 . 4. 6) and (2 . 4. 7) is 
of order m, then the Dirichlet boundary conditions, defined on the 
boundary 3D of the region D, are 


3n (t| ^8D 


8 m - 1 xW 


3n 


m-1 


= 0 ¥ te[ 0, T] 


3D 


(2.5.1) 


where n denotes the normal to boundary 3D and 


is the k 


th 


3n 


derivative normal to, and directed to the exterior of, the boundary. 

As an example of Dirichlet boundary conditions, let us consider the 

2 

heat equation defined on the unit circle in R , that is, the equation 


3x (t, z) 
3t 


= H 


9 x (t, z) 8 x(t, z) 

„ 2 + 2 
9z l 9z 2 J 


where the spatial variable z is the vector 
domain D is 

D = { zeR 2 : z 2 + z^ C 1 } 
The boundary 3D is, of course, 


p = constant 


and the spatial. 


3D ={zeR : z ^ + z^ = 1} 


The Dirichlet boundary conditions tell us that 


x(t - z > Lsd = 0 

2 2 

or, the temperature on the circle + z^ = 1 is required to be 0. 
Moreover, since the order of the system, m, is 2, we have the 


remaining condition 



9x(t, z) 
9 n 


9x(t, z) 
zedD 9 Z 1 


cose + 

dz z 


sin 6 = 0, V" 0e[ 0 , 2 it] 


ze9D " 'ze9D 

which tells us that the component of the gradient normal to the unit 
circle must be 0, i.e., no heat flow outward through the boundary. 

In order to incorporate this within the framework of the theory 
discussed in Section 2.2, we must first develop the concept of Sobolev 
spaces of negative and fractional orders. The Sobolev space of 
negative order H m (D) can simply be looked upon as the dual space 
of the Sobolev space of positive order [^(D), or (H rn (D))=H m (D). 
Fractional order Sobolev spaces are defined by means of Fourier an- 
alysis. If z is the spatial variable, which is an element of R n , then 
the Fourier transform of x(z), is 


3^ x(C) = f exp(2Trj(C • z))x(z)dz (2.5.2) 

D 

where (£.-z) is the usual vector inner product on E? 1 , It is shown”' 
that the Fourier transform of the differential operator operating 

on x{z), ^7 D^x(ji) is of the form 

J D q x(U = (2irj)l q ^ q < ?x(P > VxeL Z (D) (2.5.3) 

where ^ is the product defined in Section 2. This results in an al- 
ternative definition of the Sobolev space H m (D), namely 


H m (D) = {x : i q y X eL 2 (D)¥ q with |qj<m} 
or, equivalently, 

H m (D) = {x : (1+ U| 2 ) m/2 ^x(f) eL 2 (D)} (2.5.4) 

There is no restriction in allowing m to be any real number in Ex- 
pression (2.5.4), rather than requiring it to be a whole number in 
Section 2. Thus, we have arrived at the specification of fractional 
order Sobolev spaces. These are again Hilbert spaces with inner pro- 
duct given by 
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(x,y) 


H m (D) 


= ((i+kl 2 ) ^ 5^ x »( i+ U I 2 ) ^ Jr Y) 


L 2 (D) 


The theorem of the trace, stated and proved by Lions and Magenes, 


yields the information that the normal derivatives 


_9^x 

9 n k 


given m 


9D 


Eq. 2.5.1 are elements of the fractional Sobolev spaces if 1 
0<k<m-l } if xeH^D), and the transformation x-*-{ 


(9D), 


9^x 

9n k 


, 0< k < m -1 


9D 


is a continuous linear surjection of H m (D) onto the product space 


m-1 


i n-1 , i 

IT H m ' (9D) . The kernel of this transformation, that is, the 
k=0 


space of xeH m (D) for which x 


I _ 9* 
9D ” 9n 


„m-l 

9 x 


9D 


9n" 


m-1 


= 0, is 


9D 


the closure of the space of infinitly differentiable function of compact 
support in D, C o (D), in the norm of H m (D). Dunford and Schwartz 
(see Ref. 2.1, p. 1652) denote this closure as ^(D), so that we have 


CO 


shown that we can repre sent this closure of C q (D) in the following 


manner: 


H™(D) = (xelf^D) : 

° dn 


=0, 0< k < m-1} 


(2.5.5) 


9D 


Since H^(D) is a closed subspace of H m (D), and therefore a Hilbert 
space (with the inner product of H m (D)), it may just as easily be con- 
sidered as a candidate for a state space, in the sense of Section 2.2, as 
H m (D). Thus, the additional consideration of Dirichlet boundary con- 
ditions does not divert our course from that of developing a system 
theory analogous to that of finite dimensional systems . 

The Neumann boundary value problem is associated with a second 


order elliptic operator of the form 
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A 


n n 


2 2A 

i=l j=l 


(a. .(z) 
ij 


3 


3z. 

J 


} + a o 


(z) 


(2.5.6) 


with the coercivity property, Inequality 2 . 3 . 3, requiring that there exist 

an a > 0 such that 
n n 


II* 

Ul i=l 


(z)^.C.> a(q + 


r 2 + 


-0 


n 


for all £eR and 


and a (z) > a > 0 : for all zeD 

o' ' — 

The Neumann boundary condition relative to A is 




g 


where ^ ^ a. ,{x) cos (n, z , n is the normal to the ex- 

3v. L-i L-t ij dz j" 

■n- ■ i 1 


n n 

V' V' 


3x 


i=l J=1 


.th 


terior of 3D at ze3D, and therefore cos(n, z ,.) is the j direction 
cosine; g is a specified function. Since by the theorem of the trace, 


discussed above for the Dirichlet problem 


3x 


must be an ele- 


3D 


- 1 /2 _ i /z 

ment of H ' (3D), so must it be true that geH (3D). In this case 


the kernel of the transformation x 


9x 

3v a 


-g is not so readily 


3D 


identifiable as was the case for the Dirichlet transformation, however, 
direct use of this kernel itself will not cause too many analytical 
stumbling blocks. 

Ellipticity of the system operator for both types of boundary con- 
ditions is required to prove existence and uniqueness of solutions for 

]r 20 

either parabolic or hyperbolic systems. ~~ 1 The property of 

strong ellipticity will be used to derive a very useful system theoretic 
result in Section 2.7 and optimization results in Chapter IV. 
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2.6 SEMIGROUPS OF OPERATORS 

This section and the final two sections of this chapter are de- 
voted to semigroups of operators and the systems which generate them. 
It will complete the system theoretic description of distributed param- 
eter systems by giving the distributed parameter analog to transition 
matrices and variation of constants formulae of finite dimensional 


system theory. In this section we shall consider semigroups of operators 
defined on a general Banach space with range in X" * It will be 

useful in the sequel to consider these operators as elements of a space 
of operators , 6 (X> , the space of endomorphisms on the Banach 
space X • Let us make the following definition: 


Defin i tion 2.6 : A mapping <&{t) : [ 0 ,<» ] -—<5 (X) » denoted by 
{^B(t)} te j- q m ^ , is called a one-parameter semigroup of endomorphisms 
with parameter te[0,<*}, if for all 0>°°) 


*(t x +t 2 ) = ^(t 1 )^<t 2 ) 


( 2 . 6 . 1 ) 


Equation 2.6.1 is called the semigroup property, and the set of oper- 
ators {3>(t)} te .j- q ^ with the semigroup property will be referred to as 
a semigroup of operators as a matter of convenience. 

Two different types of semigroups of operators may be con- 
sidered, depending on the manner in which ${t) converges as t ap- 
proaches zero. The convergence may be uniform in the operator 

topology of 6 <X>. or more specifically, lim |[d?(t) ' <l>(0) j| = 0, where 

t— 0 

the norm is the usual induced operator norm on X* • For this case of 

uniform convergence the procedure of characterizing the semigroup of 

operators is quite straightforward and stands as a direct analogy to the 

A t 

description of the matrix e— in finite dimensional systems. The other 
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type of semigroup to be considered is one in which the convergence as 

t approaches zero is strong, or lim j[<&(t)x - c&(0)x|[ = 0 V xe^. 

t —0 

With strong convergence the analysis is much less straightforward. 

The Banach space of endomorphisms on3CT > (t- QC) > is a Banach 
algebra, and Hille and Phillips (Ref. 21 , p. 283) show that for any 
Banach algebra 3 and uniformly continuous f(t) : [ 3 such that 

f(tj+t 2 ) = f(t 1 )f(t 2 ) , for t 1 ,t 2 e[0,a>) 

then f{t) must be of the form 

CO 

f(t) = 1 + (2.6.2) 

n=l 

where I is the unit (identity) element of the Banach algebra (3 and 
a is some unique element of (3 • The s eries is absolutely convergent 
for all te[0,a>). This result can be specialized, of course, to the Banach 
algebra of prime interest, namely, £ -6QC). Any uniformly con- 
vergent semigroup of operators {$(t)}^j- can be represented by 

the expression 

$(t) = exp(tA) (2.6.3) 

where A is a bounded operator in 3 (yC) and the exponential ex- 
pression follows from Eq. 2.6.2. 

An important relation exists between the resolvent of the oper- 
ator A and the Laplace transformation of the semigroup. The 

-l 

res olvent is the operator R(h;A) = (hi - A) defined for all values of 
h for which the inverse exists. It can be shown (see, for example, 

Ref. 21, p. 338) that the resolvent operator is the Laplace transform 
of the semigroup operator 
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R(\;A) = / e‘ Xt *{t)dt 
0 

for all X with jx[> [Ja || , and, moreover, as might be expected from 
knowledge of Laplace transform theory, 

${t) = ~r f e Xt R(\; A)dX 

r 

where F is a closed path surrounding the spectrum of A in the clock- 
wise sense. 

The operator A is called the infinitesimal gen e rator of the semi- 
group and special note should be taken of the fact that it is bounded. It 
is also important to note that every bounded operator in £(X) is the 
infinitesimal generator of a uniformly convergent semigroup of oper- 
ators. This leads to the conclusion that unbounded (or, more par- 
ticularly, differential) operators do not generate uniformly convergent 
semigroups of operators, so that attention naturally become s focused on 
strongly c onve r gent“s emigroups of operators. 

In order to characterize strongly. .convergent semigroups of 
operators, we first make the following definition: 

Definition 2.7 : The infinitesimal operator A q of a semigroup 

{^'' t )) te [ o,co) is defined fry 

A x = lim Ax (2.6.4) 

p— > -0 

where A^ = ~ [ $( 17 ) - i] 

whenever the limit in (2.6.4) exists . 
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The set of xe °)C for which the limit exists is simply called the domain 
of A^, Do(A o ) . We would ideally like to achieve an exponential charac- 
terization of the semigroup as in the uniform case, but in this case the 
candidate for infinitesimal generator, A , is not bounded (the domain of 
A q is not necessarily all of ) so that an exponential expression in- 
volving A would be meaningless.' 1 ' Aid in this dilemma lies in the 
fact that the operators A^ given in Definition 2.7 are bounded oper- 
ators so that we might expect the exponential solution we desire to be 
some kind of limit of exponential expressions involving the A^’s. It 
is shown (see Ref. 22. , p. 401) that a limiting exponential solution does 
exist, namely 

<f>(t)x = lim exp(tA )x V" xeDo(A ) (2.6.5) 

r?— 0 ^ ° 


where the convergence is uniform with respect to t in every finite 
interval [0, s] . So every strongly convergent semigroup has the charac- 
terization (2.6.5). 

The most important question of all, at least for our purposes, is 
under what conditions will an unbounded operator A be the infinitesimal 
generator of a strongly convergent semigroup of operators^ The Hille- 
Yosida theorem (Ref.21,p363) tells us that a sufficient condition for a 
closed linear operator A to be the infinitesimal generator of a semi- 
group {<&(t)}^|-Q ^ such that |j$(t)|[<M is that the domain of A be 
dense in and the following inequality holds 


Despite this, we shall use infinitesimal generator and infinitesimal 
operator interchangeably. 

vL. ou 

The Inequality 2.6.6 is a sufficient condition for the inverse Laplace 
transform of R(k; A) to exist. This inverse transform is the semi- 
group {$(t)} te[0>co) . 
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|| (Xl-A)" 1 1 < MX” n for X > 0 and n=l,2,3,... (2.6.6) 

We now have the tools to determine whether the spatial dif- 
ferential operators of Section 3 of this chapter are infinitesimal gener- 
ators of semigroups. This is the direct concern of the next section. 

2.7 DIFFERENTIAL OPERATORS AS INFINITESIMAL 
GENERATORS OF SEMIGROUPS 

This section relates what has been stated about semigroups of 

operators with the characterization of solutions to partial differential 

equations. For systems containing elliptic operators of even order 

18 

Dunford and Schwartz ' establish the connection by giving the necessary 
conditions for the differential operator to be the infinitesimal generator 
of a semigroup of operators and showing that the solution of the un- 
forced parabolic equation associated with the spatial differential oper- 
ator at time t is simply the operator $(t) operating on the initial data. 

To qualify for infinitesimal generator the system operator A, 
given by Eq. 2.3.1, must satisfy a condition which is a slight modifi- 
cation of the condition for strong ellipticity given in Section 2.3, namely, 

(-if 11 / 2 I a q (z) <0 for aU £,eR n , zeD (2.7.1) 

|q I =m 

Additional restrictions must be placed on the state space to be 
considered. First, let us define two restrictions A^ and A^ of the 
operator A which have the following properties : 

Do(A^) = cf (D) J -Ap* = Ax V- xecf (D) 

( 2 . 7 . 2 ) 

Do(A 2 ) = H m (D) ; A 2 x = Ax ¥ xeH m (D) 
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We now define the extension of which has the property 

Do(A 3 ) = ^(DjflH^fD) ; A 3 x = A 2 x ¥ xeDo(A 3 ) 


(2.7.3) 


where H o ' (D) is specified in Section 5. Note that the problem of 

Dirichlet has entered with the introduction of the space h™ /z (D). 

With these assumptions on the operator A and on the state space 
to be considered, Dunford and Schwartz ° prove a theorem, stated in 
detail in Appendix A, which,' in summary, yields the following results: 

(1) A 3 is the infinitesimal generator of a semigroup of bounded 
operators {<?> (t) ) te [ 0 >co j 

(2) If x^eDofA^) is the initial condition for the equation 
x = Ax, then the solution is x(t) = <&(t)x o 

It is clear that the differential operator Q^/dz , which is the sys- 
tem operator for both the one -dimensional heat equation and the trans- 
verse beam vibration equation, satisfies the Inequality 2 . 7. 1 , and, 
therefore, by the_result of this section, is anoperator having a restriction 
A 3 which is the infinitesimal generator of a semigroup of operators. 

Note that the condition for strong ellipticity, Inequality 2.3.4, is 
the condition for strict inequality in Expression 2. 7.1 . If (t) Q ffl j 
is the semigroup of operators generated by a strongly elliptic operator, 
following the above procedure, then the bounded operator <f>(t) has the 


exponential bound 


!l${t) [|< Me" 


where M and X are positive constants. 

Thus, we are able to characterize the solutions to unforced partial 
differential equations with the aid of a distributed parameter equivalent 
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of the finite dimensional transition matrix . It remains to characterize 
solutions of the forced partial differential equation. 

2.8 VARIATION OF CONSTANTS FORMULA FOR 
DISTRIBUTED PARAMETER SYSTEMS 

We are now in a position to characterize solutions of distributed 

parameter systems described by forced parabolic and hyperbolic partial 

differential equations . This characterization will take an analogous 

form to the variation of constants formula familiar in finite dimensional 

system theory and will complete the system theoretic description for 

distributed parameter systems. 

23 

Phillips'' - ' proves a theorem, stated specifically in Appendix B, 
which yields a variation of constants formula for the parabolic system 
des cribed by Eqs . 2.4.2 and 2.4.3. The neces s ary assumption is that 
the system operator A is the infinitesimal generator of a semigroup 
of operators (<|>{t) j- ^ ^ as described in the preceding section. The 
result is that the solution of the equation 

x(t) = Ax(t) + f(t) ; x(0) - 

t 

is x{t) = <Kt ) X Q +J* ${t-<r) f(<r)d(r (2.8.1) 

0 

The only requirement on f(t) for this characterization to be valid is 
that f(t) be strongly continuously diffe rentiable . * Of course, from the 
arguments of Section 7, the initial condition x q must be in the domain of 
the operator A^ defined in that section. 


That is , lim — L 
h— 0 


exists in the strong topology of 


L 2 (0,T;L 2 (D)) 
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A similar result can be achieved for forced hyperbolic systems 

IT 

of the type represented by vector Eqs. 2.4.6 and 2.4.7. Fattorim 
shows that the solution to this hyperbolic system can be written in a form 
similar to that of Eq. 2.8.1 by first introducing the two strongly con- 
tinuous operator -valued functions <f>^(t) and $ 2 (t). ^ (t) is the oper- 

ator function which is obtained by writing the solution x(t) of Eq. 2.4.6 

fx "1 


with f=0 and with boundary condition x^ 



in the form 


x(t) = # x (t) X q 


(2.8.3) 


Let us denote the solution of Eq. 2.4.6 with f=0 and with initial con- 
0 


dition x = 
— o 


x 

»- 0 j 


as v(t) and write v(t) in the form 
v(t) = $,(t)x 


(2.8.4) 


It 


is clearly seen that $^(t) and <J> 7 (t) are related as 


t 

$ 2 (t)x Q = J ^(ctJXq do- 

0 

Now, if f is twice continuously differentiable and if x q is 0_ , the 
s olution of (2 . 4. 6) can be shown to be ■ 

t 

x(t) = J <£>•> (t-cr)f(cr)d(r (2.8.5) 

0 

Combining (2 . 8 . 3 ) , (2.8.4), and (2 . 8 . 5 ) with the general initial con- 
dition given by (2.4.7) the solution of (2.4.6) is given by 

t 

x(t) = (t)x Q + $, (t)x Q +J*$ 2 (t-o-)f{tr)dcr (2.8.6) 

0 

It is quite reasonably asked whether the operator-valued functions $^(t) 
and <f> 9 (t) are semigroups or not, and, if so, how are they generated 9 
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The answer is that they are not exactly semigroups, but are very 

r, 

closely related to them. Fattorini'"' proves that the resolvent of A 
is not the Laplace transform of either $^{t) or $ ? {t), but the following 
relation exists between the Laplace transforms of $^(t) and $->(1) and 
the operator R(X 2 ;A) = (X 2 I-A) ^ : 

CO 

J e ^' t $j(t)x{dt = XR(X 2 ; A)x 

0 

xeDo(A 3 ) (2.8.7) 

CO 

J e ^ t $ 7 (t)xdt = R(X 2 ;A)x 
0 

Moreover, there exist constants K and to < such that |j $j(t) [|< Ke , 
if $ oft) || < Ke^. The variable X 2 appears in the resolvent expression 
because we are dealing with a second order time derivative in the sys- 
tem equation . 

The value of having variation of constants formulae like Eqs . 2.8. 1 
and 2.8. 6 does not lie in having exact specification of solutions to 
partial differential equations, but in having a specific form of the so- 
lution will become extremely useful in the optimization results pre- 
sented in Chapter IV. 



CHAPTER III 


FORMULATION OF THE CONTROL PROBLEM 
3.1 INTRODUCTION 

This chapter is concerned with the mathematical description of 
the problems which will be solved in Chapters IV and V. For both 
parabolic and hyperbolic systems the state regulator problem will be 
introduced. The set of admissible controls will be defined and the 
quadratic cost criterion will be specified. This cost criterion will be 
shown to be analogous to the quadratic cost criterion customarily speci- 
fied for a finite dimensional system. In addition, the restriction of the 
class of controls to those which are applied at a finite number of points 
within the spatial domain is considered and the subsequent modifi- 
cation of the cost criterion will be specified. 

In Section 3 . 2 of this chapter precise descriptions of both the 
system and the control space are given. This will correspond to the 
state equation“description for finite dimensional systems in the form 
x = Ax + Bu. Conditions on the distributed parameter analog of the 
B matrix are specified. 

Section 3.3 is concerned with the remainder of the formulation of 
the distributed parameter state regulator problem- -namely, the intro- 
duction of and justification for a meaningful quadratic cost criterion for 
the systems described in Section 3.2. 

Section 3.4 contains the restriction of the set of controls to a 
finite dimensional space as described above- -a restriction to be called 
the pointwise control problem. 


- 37 - 
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3 .2 THE SPACE OF CONTROLS 

The general parabolic and hyperbolic systems to be considered 
are given by Eqs. 2.4.2 and 2.4.3 for the parabolic case and by 
Eqs . 2 . 4 . 6 and 2. 4. 7 for the hyperbolic case . Moreover, we shall re- 
strict ourselves to the case of Dirichlet boundary conditions given by 
Eq. 2. 5, 1. This will result in the consideration of the Hilbert space 
H m (D), rather than H m (D), for the state space for the system as de- 
scribed in Section 2.5. This is not a severe restriction and does not 
fundamentally affect the generality of the results, since, as was 
mentioned in Section 2.5, other types of boundary value problems can 
be placed within a Hilbert space framework similar to that of H^(D) 
in the Dirichlet problem. The system operator A is assumed to be 
either coercive or the infinitesimal generator of a semigroup of oper- 
ators as discussed in Chapter II. 

With these assumptions we enter into a discussion of the form 
of the forcing function f( • ) appearing in both Eqs . 2.4.2 and 2.4. 6, 
which we rewrite 


~ = Ax{t) + f(t) 


X x (t) 


x x (t) 


0 " 


= a 


+ 


_x 2 (t)_ 


X (t) 

L 2 _ 


f(t) 


(2.4.2) 

(2.4.6) 


Note that f(-) is required to lie in the function space L (0, T; L (D)).'*' 
In order to put the forcing term in a form which appears more commonly 
in system theoretic notation, let us introduce the control u(t), where, 


The exact requirement is that for each instant of time t, f(t) must 
be an element of H^ 1 (D) = H -rn (D) . Since L 2 (D)CH' m (D) there is 
no great loss of generality and the attractiveness of using I, (D) is 
overwhelming. 
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for each instant t, we require u(t) to be an element of a Hilbert space 

U, the control space. Moreover, let us assume that u( - )eL^(0, T; U) . 

To complete the description of the forcing term we define B(t) as a 

bounded linear operator defined, for each value of te[ 0, T] , on the 

control space U with range in L?(D), or, in more convenient notation 

2 

B(t)e £ (U;L (D))V‘te[ 0, t] . W e now make the identification of the 
forcing term f(t) as 

f(t) = B(t)u(t) , Vte[0,T] (3.2.1) 

The parabolic system now becomes: 

= Ax(t) + B(t)u(t), x(0) =x Q eH^{D) (3.2.2) 

And if the "vector" operator & (t) is defined to be 



then the hyperbolic system is represented by 
dx(t) 

x(t) (t)u(t) ; x(0) =X Q ' x 0 i eH ^( D ) (3.2.3) 

One further assumption must be made- -B (t)u(t) is assumed to be 

2 2 

a strongly (in L (0,T;L (D))), continuously differentiable function of. t. 
This assumption will enable us to use the variation of constants formula 
given in Section 2 . 8 (Eq. (2.8.1)). 

With the control u defined and the manner in which u enters the 
parabolic and hyperbolic systems clarified, we proceed to the formu- 
lation of quadratic optimization problems for these systems in Section 3 

3 .3 QUADRATIC CRITERIA FOR PARABOLIC AND 
HYPERBOLIC SYSTEMS 

In this section quadratic cost criteria weighing the state and the 
control introduced in the previous section are presented for systems 
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(3.2.2) and (3.2.3) . These criteria will be seen to be directly 
analogous to finite dimensional quadratic cost criteria. The choice of 
a quadratic cost criterion is sometimes motivated by practical con- 
siderations. In many applied distributed parameter control problems 
of interest it is not feasible to consider driving the system to a fixed 
final state distribution, but rather one would have deviations from a 
desired distribution -be damped out by the control system. Thus, a 
weighted sum of the deviation of the state from the desired distribution 
and the magnitude of the control is the necessary type of criterion. The 
particular choice of a quadratic cost criterion also stems from the 
hindsight that what yielded such elegant results as linear feedback con- 
trol laws in the lumped optimal control theory should yield at least some 
fraction of the same in distributed optimal control theory. 

As a preliminary to the specification of a quadratic cost criterion 
for the problems under consideration we make the following definitions: 

D efinition 3.1 : The. bounded linear operator Q(t), defined for all 

te[0,T] on the Sobolev space - H^(D) with range in H^ 1 (D), is called 
the state weighting operator. Q(t) is assumed to be a self-adjoint 
positive operator, that is, Q(t) has the properties: 

1. Q(t)=Q*(t) ,Vte[0,T] 

2. <x,Q(t)x> >0 ¥ xeH m (D), ¥ te[0,T] 

h“<dj 

The form <x, G(t)x> corresponds, for each te[ 0, T] , to a 

(D) 

positive weighted average over the spatial domain D. This spatial 
weighting is, of course, imbedded in the Hilbert space notation. It is 
seen that Q(t) corresponds directly to the positive semidefinite state 
weighting matrix Q(t) for the finite dimensional state regulator 
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problem treated by Athans and Falb (Ref. Z4, Chapter 9). We make 
the further definition: 

Definition 3.2 ; The bounded linear operator R(t), defined for 
all te[ 0, t] on U with range in U, is called the control weighting 
operator. R(t) is assumed to be a self-adjoint strictly positive oper- 
ato r , or 

1. R(t) = R*(t) , ¥ te[ 0, t] 

2. <u, R{t)u> y>a.[|uJ|ij , for some a > 0,¥ ueU,¥ te[ 0,T] 

Once again, this corresponds to the positive definite control weighting 
, matrix R(t) for the finite dimensional problem and the form 
<[u,R{t)u]>y is a weighted average over the spatial domain D. It is 
sometimes desirable to add a penalization cost for deviation of the 
state distribution at the final time T from the desired distribution. 

For this case we have: 

Definition 3.3 : The bounded linear operator F, defined on 
H^(D) with range in H m (D), is called the terminal state weighting 
operator, F is assumed to be a selfr-adjoint positive operator, or 
1 . F = F* 

2. <x,Fx> >0 ¥ xeH^{D) 

hL (D) ° 

o 

Not surprisingly, the operator F corresponds to the terminal state 

weighting matrix F in the finite dimensional regulator problem and, 

of course, the form <C[x,Fx}> is a weighted average over the 

H^D) 

spatial domain D. ° 

If we denote the desired state • distribution as x^(t)eH^(D), we may 
now state the cost criterion for parabolic systems as: 



X 

J= f [ <(x(t)-x d (t)), Q(t)(x(t)-x d (t)> 


H** 1 (D) 


+ < u(t), Ru(t) > TT ] dt + <(x(T)-x (T)),F(x(T)-x (T))> 

U a a H^"( D ) 


(3.3.1) 


where x{t) is the solution of (3.2,2) with the control sequence 
a(t)eU, te[ 0, T] , specified. The optimal control problem may then be 
iefined; 

Def inition 3 . 4: The optimal control problem for the system (3.2.2) 

ls to determine the control u*(t), te[0, T] , with u*(t)eU for all 
;e[0,T] such that, if x*(t) is the solution of (3.2.2) with u(t)-=u* (t) , 

:he functional J in (3.3.1) is minimized. The minimizing control 
i*(t), te[ 0, T], is called the optimal control (if it exists). 

As an example of an optimal control problem for a parabolic 
system, let us consider the heat equation, given in Section 2.4, with 
:he control u(t) entering in a forcing term. Assuming Dirichlet bound- 
iry conditions for this problem, we have: 


x = Ax(t)+B(t)u(t) ; x(0) = X Q eH^ (D) 

where .A is the operator defined on (D), corresponding to the 

2 ° 

9 ° 

spatial differential operator jjl ^ • Let us choose x^(t) = 0 and the 

9z 

:ost criterion to be such that we penalize mean square deviation of the 
state trajectory from zero and total expended control energy, that is, 
ve choose a criterion of the form: 


X 

I 


J*x^(t, z)dz + rj*u^(t,z)dz 
D D 


dt 


reR 1 , r > 0 
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This can be put within the framework of the optimal control problem 
specified in Definition 3 . 4 if we choose Q(t), R(t), and F to be: 

1. Q(t) is the identity operator on (D), which can be written 
as the integral operator 


Q(t)x(t) = I x(t)=/6(z-C)x(t,yaJ , ■¥ x(t)eF (D) 

H o < D > D 

where 6(z-£,) is Dirac delta function. 

2. R(t) is the identity operator on U, multiplied by the scalar 
r, or 

R(t)u(t) = rly-uft) = rj*6(z-^)u(t, f,)df, , u(t)eU 

D 


3 . F is the zero operator 

With these choices of Q(t), R(t), and F the cost criterion of Eq. 3.3. 1 
is seen to be the desired cost criterion. 

The preceding discussion must be modified somewhat to achieve 
the definition of the control problem for hyperbolic systems. As a 
preliminary to this modification, let us consider a general 2X2 
matrix operator whose elements are bounded linear oper- 
ators on a Hilbert space H. operates on the two dimensional vector 

x the components of which are elements of H. It is useful to define 
the inner product < X x > = x'/^ x as 

2 2 

xV/7 x ■ X X<-,v j> H (3.3.2) 

i-1 j-1 


We are now in a position to make the modifications of Definitions 3.1, 
3.2, and 3.3 to fit the hyperbolic case, beginning with the definition of 
the state weighting matrix operator: 
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Definitio n 3.5: The 2X2 matrix (t) , the elements of which, 

Q^(t), are bounded linear operators defined, for all te[0,T], on 

H>) with range in H™(D), is called the state weighting matrix oper- 
ator. at, (t) is assumed to be a symmetric positive semidefinite matrix 
with self-adjoint elements, or 

1. Qjjft) = Qj.(t) , Vte[0,T] , i,j=l,2 


2 2 

2. x’^(t)x - ^ ^<x. ,Qij(t)x. > 

i=l j=l 


H m (D)" 
o ' ' 


> 0 , ¥ x.eH m (D), 

— 1 o 


i = 1,2 , ¥te[0,T] 


3. Q (t) = DT.(t) 

J J 


V te[ 0 , T] 


There is no need to modify Definition 3 .2 for the cost weighting oper- 
ator, since the control space is the same for both parabolic and hyper- 
bolic systems. However, the terminal state weighting operator of 
Definition 3 .3 must be modified as follows: 

D efinition 3.6: The 2X2 matrix c7^ , the elements of which, 

F.j, are bounded linear operators defined on H^ 1 (D) with range in 

Hf(D), is called the terminal state weighting matrix operator. is 

assumed to be symmetric positive semidefinite with self-adjoint ele- 
me nts , 


1 . 

F.. = F.. 
H J 1 

, i,j = 1,2 
2 2 

2. 

x’^x = ' 

y y<x.F..x. 


<u 

i=l j = l 

3 . 

F.. = FT. 

li 11 

ij =1,2 


If we now denote the desired state vector as x^(t), the cost cri-r 
terion for parabolic systems is given by: 
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T 

J =/[(i(t)-x d (t))^ (t)(x (t)-x d (t)) + <u(t),R(t)u(t)> u ]dt 
0 

+ (s(T)-x d {T)) , 5^(x(T)-x d (T)) (3.3.3) 

where x (t) is the solution of (3.2.3) with the control sequence 
u(t)eU, te[0,T] , specified. Just as in the case for parabolic systems, 
the optimal control problem is similarly defined for hyperbolic systems. 

Definition 3.7: The optimal control problem for the system 
(3.2.3) is to determine the control u*(t), te[ 0 , T] suchthat, if x*(t) 
is the solution of (3.2.3) with u(t) = u*(t), the functional J in (3.3.3) 
is minimized. The minimizing control u*(t), te[0,T] , is called the 
optimal control (if it exists). 

As an example of an optimal control problem for a hyperbolic 
system, we consider the forced equation for longitudinal vibrations in 
a rigid beam, the unforced version of which is given in Section 2.4. 
Assuming Dirichlet boundary conditions, the equation may be written 
in the form o~f Eq. 3.2.3 , namely: 


dx (t) 

d 

x l(t) 


“o 

f 



dt 

dt 

_ x 2 {t) _ 


A 

0 


x (t) 

L c J 




0 

B(t)J 


u(t) 


x{0) 


X 


01 


^ 02 - 


x or x 02 eH o 


where A is the operator on H (D) corresponding to the differential 
9 2 ° 

operator p ' y - In this example let us choose x^(t) = _0 and have the 
8z 

cost criterion penalize both the mean square derivation of the tra- 
jectories x^(t) and x.,(t) from zero as well as the expended control 


energy, or 
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T r 

J = J J (Xj(t, z) + ^x^t, z))dz -f r 2 Ju.(t, z)dz 
0 *"D D 


dt 


with r and r-, both positive real numbers. It is seen that the choice 
1 c 


of 


2 (t) = 


H o < D > 


ri H 2 (D) 

o J 


and 


R(t) = r 2 I 


U 


where I and I . are the identity operators specified in the ex- 

H o {D) ■ 

ample following Definition 3.4, puts the cost criterion of Eq. 3.3.3 in 
the above desired form. 

The parabolic control problem defined in Definition 3 .4 will be 
studied in great detail in Chapter 4, whereas the hyperbolic control 
problem of Definition 3. 7 will be briefly discussed in Chapter VI. Important 
special cases of these problems are discussed in the next section of 
this chapter. 


3.4’ THE POINTWISE CONTROL PROBLEM 

The optimal control problems defined in the preceding section 
will be specialized in this section to consider the case where the control 
does not enter into the system in a distributed fashion, but rather con- 
trol energy enters the system at a fixed number of "points" within the 
spatial domain of the system. The justification of the use of this type 
of pointwise control is on physical grounds . For many physical distri- 
buted parameter systems it is next to impossible to drive the system by 
application of a control distribution. For instance, in the rigid beam 
.considered in the preceding section, the control energy would enter 
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much more realistically as forces at various discrete points along the 
length of the beam, rather than a "perfect" distribution of force de- 
fined at every point of the beam. Another example is the membrane 
of a drum. Here the distributed displacement of the tympanic mem- 
brane is achieved through the approximately pointwise control of the 
impulsively applied beating of the drumsticks. In both of these cases 
the analysis would become terribly complicated if the control were 
modeled by a distribution on the spatial domain. Since it is more 
likely that one would approximate the distributed control in many phys- 
ical systems by a finite number of lumped controls, this would moti- 
vate the a priori use of non -distributed controls and the subsequent 
optimization problem in terms of these controls. Moreover, it seems 
more likely that the analytic specification of an optimal control distri- 
bution would be much more difficult than the specification of an optimal 
control vector. In essence, the pointwise control problem is a hybrid 
of pure distributed parameter control and finite dimensional control. 

If we suppose that control is applied at the k points z.,i=l,2, .. . ,k, 

the control space U to be considered is k-dimensional Euclidean space 

R , or, in other words, the control defined in Section 2 is assumed to be 

a k -vector u. On first thought, it would be desirable mathematically 

to have the forcing term of Eqs. 3.2.2 and 3 . 2. 3 be of the form 

k 

V""\ 

B(t) u (t) = 2 ^ 6(z-z.)b.{t)u.(t) (3.4.1) 

i=l 

where 6(z-z.) is the Dirac 6-function defined on the spatial domain D, 
and b..(t), i=l,2, . . .k, are bounded continuous functions of time. 

Equation 3.4.1 reflects "true" pointwise control, that is, finite control 
energy really enters at the set of control points {z.}^. Unfortunately, 
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expressions of the form of the right hand side of Eq. 3.4.1 cannot be 

elements of D (D) for each te[0,T], because the Dirac 5-function is 

not square integrable. Since it is required in Section 2 that the forcing 

2 

term be an element of L (D) for all te[0,T] , we must abandon hope 
of using "true" pointwise control. 

The next logical step is to assume that control action takes place 
over a small volume surrounding each of the control points z^. This 
actually gives a more accurate picture of the procedure of applying 
pointwise control over a spatial domain, since it is a mathematical 
fiction to consider control applied at a single point. The physical justi- 
fication of this assumption can be seen by considering the examples 
given above. In the rigid beam, any device which applies force at a 
"point" of the beam cannot apply this force over a region of the beam of 
zero width. There must be some small length of the beam over which 
the force is actually applied. In the case of the drum, the vibration of 
the membrane is not caused by excitation of a point of the membrane 
with zero area, but by excitation of a small area corresponding to the 
area of the tip of the drumstick. Both of these cases represent a valid 
approximation to the pointwise control problem, since the "volumes" 
surrounding the control points are sufficiently small compared to the 
"volume" of the spatial region D. 

This pointwise control approximation is-achieved through the intro- 
duction of the following B operator; 

Definition 3 . 8 : The pointwise control operator B Q (t), defined 

k 

for all te[0,T] on R , is described by 

k 

B 0 ( t )n( t ) = ^ X i ( z )t> i (t)u.(t) Vu(t)eR k 
i=l ' 


(3.4.2) 
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where v.(z) is the characteristic function of the set E.CZD which in- 
u 1 

eludes the control point z. as described above . This characteristic 
function is given by 


X^ 2 ) = 


1 if zeE. 

i 


0 if z/E. 


The functions M{t) are assumed to be bounded on [0,T] . Note that, 

2 k 

according to the as sumption in S ection 2 of this chapter, u(.)eL{0, T; R ). 

In order to show that the form B Q (t)u(t) is an element of 

2 2 

L (0,T;L (D)), and, therefore, satisfies the required condition to be a 

forcing te rm fo r Eqs . 3.2.2 and 3.2.3, we prove the following lemma: 

Lemma 3. 1: For each te[ 0, T] , B Q (t) is a bounded linear oper- 

k 2 

ator with domain R and range in L (D) . Moreover, the function 
f(*), where f(t) =B Q (t)u (t), V , te[0,T], is an element of L 2 {0, T ; L 2 (D)) . 


Proof: 


fi B 0 (t)u(t)] Z dz = f ( j( z )b- (t) u.(t)) 2 dz 


D i=l 


-tv 

= ^ Jx f{z) b 2 (t)u 2 (t)dz = ^ b 2 (t)u 2 (t)J"x 2 (z)d: 


i=l D 


S ince 


jf Xi ? (Z )d Z =_jf x , 


(z)dz = jji(E^), the Lebesque measure of the' set E , 


and since this must be les s than the Lebesque measure of the domain D , we have 

k 

J [B o (t)u{t)] 2 dz < |i.(D)^b 2 (t)u 2 (t) = p{D) |{B(t)u(t) j| 2 k 


where B(t) is the kXk diagonal matrix with B^(t) =L(t), i=l , 2, ...k. 

If ||B{t) || , is the induced matrix norm of B(t) , it follows that B (t) 
-pK o 

k 2 

is a bounded linear operator from R into L (D) and 
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|| B (t) || 7 < H- 1/2 {D) ||B(t) |i , for all te[ 0 , T] (3.4.3) 


L (D) 




2 2 

To show that f(-) is an element of L (0, T; L (D)), we write 


lift- ) li 2 


2 2 = / Ih*') 

LT(0 , T ; 17 (D)) J Q 


1 2 
17(D) 


dt 


= f |[B (t)u(t) || 2 2 dt 
n LP) 


which, by Inequality 3 . 4 . 3 , can be written 


ifiill 2 


7(0, T; 7(D)) 


< p(D)J* llB(t) || 2 k ||u(t) || 2 k dt 

« R R 


< |i(D) |[b(-) I ! 2 2 k llu(-)!l 2 k 
I7{0, T ; R / l7(0,T;R K ) 

where the last inequality is obtained by the use of Schwarz 1 inequality. 
By the assumed boundedness of the functions b^(t), i = l, . . . ,k and the 
assumption that ~u (. )eL 2 (0, T; R^) , we obtain 


i f (-)l | 2 2 2 < co 

Ij (0, T ; 17(D)) 

implying that f(- )eL 2 (0, T; L 2 (D)) . 

Since the pointwise operator B Q (t) operating on controls u in the 
control space R k qualifies as a forcing term for systems (3.2.2) and 
(3,2.3), it remains to formulate the optimal control problem for this 
case. Since the state space remains unchanged neither the state weight- 
ing operator Q(t) nor the terminal state weighting operator F must 
be modified in the parabolic system case. The same holds true for their 
counterparts z (t) and in the hyperbolic case . The control space 
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• » 1c 

is the finite dimensional space R so that the control weighting oper- 
ator is changed accordingly: 

Definition 3 , 9 : The kXk poss ibly time -varying matrix R(t), 

defined for all te[ 0, T] on R^ with range in R^", is called the point- 
wise control weighting matrix. R(t) is assumed to be symmetric and 
positive definite for all te[ 0, T] . 

We are now in a position to specify the quadratic cost criterion 
for parabolic systems with pointwise control as follows: 


J- 

[<(x(t)-x (t)),Q(t)(x(t) -X ,(t))> + u'(t)R(t)u(t)]dt 

n H^D) 


+ < (x(T)-x.(T)), F (x(T)-x ,(T))> 


H^(D) 


(3.4. 4) 


where x(t) is the solution of (3.2.2) with B(t) = B Q (t) and the control 
sequence u.(t)el^ 3 te[0,T], specified. The pointwise optimal control 
problem for parabolic systems may now be stated as: 

Definition 3.10 ; The optimal control problem for the system 
(3.2.2) with B(t) = B (t) and U=l£ is to determine the control 
u*(t), te[0,T], with u*(t)eRk' for all te[0,T], such that, if x*(t) is 


the solution of (3.2.2) with B(t)=B Q (t) and u (t) =u‘' (t) , the functional J 
in (3.4.4) is minimized. The minimizing u*(t) , te[ 0, T] , is called the 
pointwise optimal control (if it exists). 

The discussion of pointwise controls will be tabled until Chapter V, 
where the optimal pointwise control problem for parabolic systems will 
be solved. The pointwise control problem for hyperbolic systems has 
not been introduced for the reason that study of this problem will not 
yield any more insight into the nature of pointwise control than is ob- 
tained through the study of pointwise controls for parabolic systems alone. 



chapter IV 


OPTIMAL, CONTROL, OF PARABOLIC SYSTEMS 
4.1 INTRODUCTION 

The purpose of this chapter is to solve the optimal control prob- 
lem for parabolic systems as specified in Definition 3.4 of the pre- 
ceding chapter. The first concern of this chapter will be to show that 
unique solutions of the optimal control problem exist in both the case 
where the system operator is coercive and the case where the sys- 
tem operator is the infinitesimal generator of a semigroup of oper- 
ators. Next, necessary conditions for optimality will be discussed and 
a feedback solution for the optimal control will be derived for both 
types of system operators. In this chapter we shall also treat the so- 
lution of the parabolic optimal control problem defined on an infinite 
time interval, and we shall derive, an inte gro -differ ential equation the 
solution of which specifies the form of the optimal feedback control law. 

In Section 4.2 the existence and uniqueness of solutions of the 
optimal control problems for both types of system operators is demon- 
strated. With existence and uniqueness guaranteed, we derive, in 
Section 4.3, the necessary conditions for optimality in the coercive 
system operator case and show, in Section 4. 4, that these necessary 
conditions imply the existence of a feedback form in which the feed- 
back operator is seen to satisfy a nonlinear operator equation of the 
Riccati type . The minimum value of the cost criterion will also be 
shown to be directly expressible in terms of this feedback operator. 
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Section 4.5 is concerned with the proof that if a bounded solution 
of the operator equation discussed above exists in the case where the 
system operator is the infinitesimal generator of a semigroup of 
operators then the feedback form derived for the optimal control in 
the coercive case is also optimal in this case. This leads naturally to 
the proof in Section 4.6 that a bounded solution of the Riccati operator 
equation does indeed exist. 

Section 4.7 contains the discussion of the parabolic control prob- 
lem defined on the infinite time interval (0,®). 

In Section 4.8 it is shown that the Riccati operator equation is 
equivalent to a nonlinear partial inte gro -differential equation. 

4.2 EXISTENCE AND UNIQUENESS OF SOLUTIONS 

In this section we examine the question of existence and unique- 
ness of solutions of the optimal control problem stated in Definition 3.4, 
We shall show that for an elliptic operator, either coercive or strongly 
elliptic, the optimal control problem for parabolic systems, as dis- 
cussed in Section 3 of the preceding chapter, has a unique solution. 

15 

Lions provides the machinery for demonstrating this, by giving a 

general existence and uniqueness theorem for controls minimizing a 
certain cost functional. This is then shown to cover existence and 
uniqueness of optimal controls in the parabolic control problem. Lions 
does not consider terminal-time cost in his cost criterion, so that any 
modification of the results due to the slightly more general inclusion of 
terminal-time cost will be indicated. 

As a preliminary to the discussion of existence and uniqueness of 
optimal controls let us make the following definitions: 



-54- 


Definition 4.1 : A coercive bilinear form II(u,v) is a mapping 
of U X U into the reals for which there exists a c> 0 such that 

TI(u,u) >c [|ul| 2 VueU 


if 


Definition 4.2; The bilinear form II(u,v) is said to be symmetric 

n(u,v) = n{v,u) ¥u,veu 


Definition 4.3 : The bilinear form tl(u, v) is said to be con- 

tinuous if it is a continuous function of each of its arguments. 

Now, having introduced the bilinear form FI(u, v), let us con- 
sider the cost functional 

J(u) = n (u, u) - 2L(u) , ueU (4.2.1) 

where L, is abounded linear functional defined on U. The existence 
and uniqueness of a control u* which minimizes J in (4.2.1) is pro- 
vided by the following theorem. 

T heorem 4.1: If IT(u,v) is a continuous, symmetric, coercive 

bilinear form, then there exists a unique u*eU such that 

J(u*) = inf J(u) 

ueU 


Existence is proved by defining a sequence approaching the infimum, 
showing it is bounded, and extracting a subsequence which has a weak 
limit in U. Since IT (v, v) is lower semicontinuous and L(v) is con- 
tinuous in the weak topology of U it is seen that the weak limit in U 
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is the minimizing element u* . Uniqueness follows directly from the 
strict convexity of the function TT(v, v) . Details have been omitted, but 
are readily available in Ref. 15. 

In order to proceed to the discussion of existence and uniqueness 
of the solution to the parabolic optimal control problem, the question 
of existence and uniqueness of solutions of the parabolic equation (3.2.2) 
must be considered. This existence and uniqueness question for the 
case of coercive elliptic system operators is best answered through the 
use of another result of Lions' which will also be used to obtain neces- 
sary conditions for optimality in the following section. 

Theorem 4. 2 : If II{u,v) satisfies the hypotheses of Theorem 4. 1, 

then J(u) has a minimum value J(u*) if and only if u* satisfies the 
equation* 

TT(u*,v) = L(v) , VveU (4.2.2) 

The proof of this theorem is due to Lions; since it is essential to 
the optimization results of Section 3 of this chapter, it is presented in 
Appendix C for the sake of completeness. 

To show how this result yields the answer to the existence and 
uniqueness question in parabolic equations, consider the bilinear form 

n(x,y) = -<Ax, y> ; x,yeH“(D) (4.2.3) 

H“(D) 

where -A is assumed to be a coercive operator, s ati sfying the 

-JU 

-r* 

If the control u* is required to lie in some convex constraint set 
QCU, the equation which u* must satisfy becomes 

IT(u*,v-u*) >L(v-u*) , V'vef^ 
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Inequality 2 , 3 . 3 . Hence, the bilinear form IT(x,y) in (4.2.3) is co- 
ercive. If we let the linear form L(v) in Eq. 4.2.1 be the inner pro- 
duct 


L(y) 


<Bu,y> 


H^(D) 


VyeH^(D) 


(4.2.4) 


H q (D) is a Hilbert space and, by a well-known result in elementary 
Hilbert space theory (see Ref. 25, p. 80), any bounded linear func- 
tional on H^(D) is an inner product of y with some element in the 

TXT 2 

dual space of H q (D) . Since L (D) is contained in this dual space, and 
since Bu is in L? (D) by our assumption in Section 3.2, then Ex- 
pression 4.2.4 is a valid linear form on H^ 1 (D) . 

The hypotheses of Theorem 4.2 are thus satisfied and, therefore, 
we have the result that there exists a unique xeH^fD) such that 

-<Ax,y> = <Bu,y> VyeH^(D) (4.2.5) 

H (D) Hr P) 

o o 

For Eq . 4.2.5 to hold for all yeH^ 1 (D) it must be true that there is a 
unique xeH^DJ-s atisfying the equation 


Ax + Bu = 0 (4.2.6) 

Thus, we have demonstrated existence and uniquenes s of a 
solution to Eq. 4.2.6. Needless to say, we have not proved existence 
and uniqueness of solutions of the parabolic equation (3.2.2), i.e. , 
x =Ax+Bu. However, Lions uses the procedure demonstrated above, 
with a few analytic embellishments to account for time evolution, to 
prove that there exists a unique solution of the parabolic equation 


x = Ax(t) + B(t)u(t) , x{0) = x q 



-57- 


Just as important. Lions shows that the mapping u(*)- < 'X{-) from 
L ( 0 , T ; U) — * L, (0, TjH^fD)) is continuous. The continuity of this 
mapping plays a role in the application of Theorem 4. 1 to the parabolic 
control problem of Definition 3.4. 

It remains to show that solutions of parabolic equations with 
elliptic operators satisfying Condition Z . 7 . 1 exist, are unique, and de- 
pend continuously on the control as was, shown for coercive operators . 
Proving existence and uniqueness is trivial in this case, since the 
hypotheses of the theorems given in Appendices A and B are satisfied 
and the solution of Eq . 3.2.2 is uniquely given by 

t 

x(t) = th(t)x Q + J *(t-cr)u(o-)dcr , X q CDo{A 3 ) (4.2.7) 

0 

where {$(t)} t ^j-Q is the semigroup of operators with infinitesimal 
generator A^ as defined in Section 2.7. Since we wish in addition to 
show that the solution depends continuously on the control we state and 
prove the followingjheorem; 

Theorem 4. 3 : If x{t) is the solution of the parabolic equation 

2 

(3.2.2) given by Eq. 4.2.7, then the mapping u(-)-<-x(-) of L (0, T; U) 
into L?(0,T; H^(D)) is continuous. 

Proof : Suppose u^(d and u^d. defined for all te[0,T], are 
elements of L 2 (0,T;U) and x^(*) and x 2 (*)> defined for 
all te[ 0 , T] are elements of L 2 (0,T;H^(D)) given by: 

t 

x.(t) = $(t)x Q + J* $(t-o-)u.(cr)da- , te[0,T], i=l , 2 (4.2.8) 

0 
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Forming the difference x (t)-x 2 (t) and taking the norm 
squared on L (0 , T; H^fD) ), we deduce that 


fk{-)-x {•) |[ 2 2 
1 ^ lf(0,T;H^(D) 


J. 

= J ||x 1 {t)-x 2 (t) || 


H m (D) 


T t 


= J || J <£(t-cr)B (o-) ( u 1 (cr) -u 2 {<r) ] do* 


0 0 


m 

H (D) 
o 


s ||${t-cr)B(cr)[u 1 (o-)-u 2 {cr)] || m doj dt 

H o ^ 


0 0 


where the inequality follows from a generalization of the 
triangle inequality for normed spaces . Since <$(t-<r) and 
B(cr) are bounded linear operators we may write the in- 


equality 


||${t-cr)B(o-)[ u^tr) -u 2 (cr)] || 


H m {D) 
o ' ' 


< #(t-(r)B(a-) 


I [|ul (o-) -u z (cr) || 

H m (D) 1 C U 
o 


so that Inequality 4.2. 9 can be written 


|x 1 (-)-x 2 (.) (I 

L Z {0,T;H^<D)) 


T t 

<J I 


i ${t-o-)B(cr) || IfuJo-) -u 2 {o-) || T dtr dt 

H m (D) 1 L U 


(4.2.11) 
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It should be noted that since jfo^ -H^HIIu is an element of 
2 

L (0,t) for all te[0,T], then the inner integral is, in effect, 

2 

an integral operator on L (0,t) with kernel k(t, cr) 

= ||$(t-<r)B(cr) j[ . If this kernel is square -summable , 

H m (D) 

o 


z 


J |!$ (t - cr) B (cr) ||“ dcr <co 

o *O d > 

for all te[ 0, T] , then by application of Schwartz' inequality 
it can be shown that (see Ref. 22, p. 148) 


/l*i t-cr)B(cr) || m || u^cr) -u 2 (cr) J^dcr 
0 H o (D) ' _ 


<Jfl#(t-cr)B(cr) f m do- J liefer) -u 2 (o) il^dcr (4.2.12) 


H o < D > 0 


<fh(t -<r) B(or) || 2 dcr || u ( • ) -u ( -)\ Z 

0 H^(D) lT(0 f T;U) 

holds for all te[ 0, T] . Now, by the uniform boundedness 
principle, which is stated in Appendix D, <&(t) is uniformly 
banded over [ 0, T] . Let us denote this bound by 
ll$\t)||<M. Mo reover, B(cr) is uniformly bounded on 
[ 0,T] , with ||B(o-) || < b. Thus, it follows that 


c 

/lls; 


(t-or)B((r) || 2 dcr<M 2 b 2 t 

H m (D) 
o ' 
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is finite for all te[0,T], and the substitution of Inequalities 
4.2.12 and 4.2.13 into Inequality 4.2. 11 yields the result that 

lk(-)-* 2 (-)ll 2 2 „ 

l/(0.T;H^(D)) 


JL 

<J 


0 


M Z b 2 t||u 1 (*)-u 2 (*) 

1 2 L{0,T;U) 


dt 


- 2 

or, equivalently, that 


<MV T 2j ( ||2 

1 17 (0, T ; U) 


||x 1 (-)-x 2 (.) il 2 m 

17(0, T;H^(D)) 

||u 1 (.)-u 2 (-)ll 2 

™ 1 L(0,T;U) 


which implies that the mapping u{ • ) into x(.) from 
L 2 (0,T;U) into L 2 (0, T; H^fD)) is continuous. 

It may also be shown that the solution at the final time T also de- 
pends continuously on the control. This is also necessary for the ap- 
plication of Theorem 4. 1 to the parabolic optimal control problem. 


Theorem 4.4: If x(T) is the solution, at the terminal time, of 
the parabolic equation (3.2.2) given by Eq. 4.2.7, then the mapping 
u(.)— x(T) of L 2 (0,TjU) into H™(D) is continuous. 


Proof : With u^(-), u^*), x^( . ) , and x^f.) defined as in the 

proof of Theorem 4.3, we have the following expression: 


|[ Xl (T)-x z (T) j| 


2 * 
H^(D) 


[[ f^T-aOBCo-^uJcr) -u ? (o-) ] dcr || 2 

o *O d > 



-61- 


f ll$(T -(r)B(or)[ u,((r) -u ? (cr)] j{ d<r 

0 H>) 


< 


/ ll$(T-o-)B(<r) II ||u 1 (<r)-u 2 ((r) ^ do- 

n K ( D ) 


and, proceeding as in the proof of Theorem 4.3, we use the 

square summability of the kernel [|${ T-<r) B(cr) J[ to 

H (D) 

deduce that ° 


||x 1 (T)-x 2 (T) 


a/2 


| <MbT^“||u (.)-u 2 (.) || 

H“(D) 1 Z 17 ( 0 , T ; U) 


which implies that the transformation u(-) into x{T) from 
L (0,T;U) into (D) is continuous. 


To summarize what has been done so far in this section: unique 
solutions have been shown to exist for the parabolic equation (3 . 2 . 2) with 
either coercive operators or elliptic infinitesimal generators of semi- 
groups as system operator. In addition, these solutions have been shown 
to depend continuously on the control u. With this as a foundation, we 
may use Theorem 4. 1 to extend Lions' results to include the case of 
terminal cost in the following manner: 


Theorem 4.5: The optimal control problem for parabolic 

systems as specified in Definition 3 .4 has a unique solution 
u*(.)eL 2 (0,T;U). 

Proof: First, let us introduce the notation x U (t) to denote the 
solution of parabolic equation (3 . 2. 2) on [0,T] with the 
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2 

control u(-)eL (0,T;U). Likewise x v (t) denotes the 

2 

solution corresponding to v(-)€L (0,T;U). The cost cri- 
terion, Eq. 3,3. 1, for the parabolic control problem can be 
written in the form 


J ( u ) - < C X ^2 m +<^u,Ru^> 2 

L^(0,T;H™(D)) 1T(0, T; U) 


(4.2.15) 


+ <x U (T) -x . { T) , F (x U ( T) -x ( T) ) > 


«>> 


which, in turn, can be rewritten as 


J(u) = n (u,u) -2L(u) + <x Qx > 

lT(0,T;H™(D)) 

+ <x (T),Fx (T)> 

a a H m (D) 

o 

where we define the bilinear form II (u, v) to be 


(4.2.16) 


A - u ~ _ w ' 


n (u, v) = <x , Qx > 


+ <x U (T),Fx v (T) > 


v. 


L 2 (0,T;H^ 1 (D)) 


H m (D) 
o v 


+ <u, Rv> 

1^(0, T;U) 


L(v) =<Qx,,x v > t<Fx,(T), x v (T)> 

d ^L 2 ( 0,T;ie(D)) ^ d H^D) 


Since the last two terms of Eq. 4.2.16 are independent of u, 
minimizing the cost functional J‘(u) 


J'(u) = n(u,u) - 2L(u) 

is equivalent to minimizing the original cost functional J(u). 
We now note that II (u, v) is symmetric since 
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H(v,u) = <x V , Qx U > 

l 2 (0,t ; h-(d)) 


+ <x V (T) j Fx T 1 (T)> + <v,Ru> 

H™(D) 17 (0 , T; U) 


= <Q*x V ,x U > 


L^O.TjH^D)) 


+ <F*x V ( T) , x u ( T) > 


H m (D) 
o 


+ <R*v,u> ? = n(u,v) 

L (0, T ; U) 

by the self-adjointness of the operators Q(t), R(t) and F 

2 

and by the symmetry of the inner products on L (0, T;H q (D)), 
L 2 {0 ,T;U), and H^(D) . 

Next we note that II(u,v) is a coercive bilinear form be- 
cause 

H(u,u)> a HullJj. , a > 0 ¥u(-) eL Z (0, T; U) 


by the positivity of the operators Q(t) and F and by the 
strict positivity of R(t) . 

Also, II{u,v) is continuous, since by Theorem 4.3 

x U (-) is continuous in u( • ) on L? (0, T; H^ 1 (D)) and by 

Theorem 4.4 x U (T) is continuous in u{ * ) on H^(D) . 

The hypotheses of Theorem 4.1 are thus satisfied, so 

2 

that there exists a unique u :;:: (- )eL (0, T; U) such that 


J'(u*) = inf J'{u) 

u(.)eL 2 (0,T;U) 

or, equivalently, there exists a unique solution of the para- 
bolic optimal control problem given in Definition 3.4. 
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It should be noted that in the above proof continuity was discussed 

2 rn 

in terms of the strong topology of L (0 ,T;H q (D)), whereas Lions' 
requires continuity in the strong topology of W(0,T) which is defined 
in Chapter II, Section 4, This more stringent continuity require- 
ment is not necessary, however, since the cost criterion involves only 
x(t) and not x(t), so that behavior of derivatives of solutions in terms 
of u is beside the point. 

We have shown in this section that unique solutions exist to the 
parabolic control problem for both coercive system operators and 
elliptic system operators which are the infinitesimal generators of 
semigroups. It remains to characterize these optimal solutions for 
both types of system operators . 

4.3 DERIVATION OF NECESSARY CONDITIONS - - 

COERCIVE CASE 

Since, in the preceding section, it has been shown that a unique 

optimal control exists, we shall derive in this section what precise 

conditions that-optimal control must satisfy in the case of parabolic 

systems with coercive system operators. Again, this derivation is 

15 

due formally to Lions, but his results are extended to include the 

case of terminal -time cost in the cost criterion. 

For convenience, let us rewrite Eq. 3.2.2 

= A x(t) + B(t)u(t) ; x(0) = x q (4.3.1) 

Recall from the preceding section that the solution of the parabolic 
optimal control problem, namely, the optimal control u* } must mini- 
mize the cost functional 


J(u) = n (u, u) - 2L(u) 


(4.3.2) 
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whe re 


n(u,v) = <x U ,Qx V > 


L 2 (0,T ; h“(D)) 


+ <x U (T),Fx V (T) > 


H m (D) 


+ <u, Rv> 

L (0, T; U) 


L(v)=<Qx x> + <Fx (T),x V (T)> 

Ij (0 , T ; HH 1 (D )) a H™(D) 


Now, by Theorem 4.2, the optimal control must satisfy 
n(u*,v) = L(v) for allv(.)eL 2 (0, T; U) 

or, equivalently. 


n(u*,v-u*) = L(v-u*)Vv( . )eL 2 {0, T; U) (4.3.3) 

Further, let us introduce the adjoint equation 

= -A*p(t) - Q[x(t) - x d (t)] * (4.3.4) 

p(T) = F[x(T) - x d (T)] 


p(t) is called the costate, and, by changing the time variable from t 
to T-t and realizing that A' is coercive if A is, the results of the 
preceding section tell us that a unique solution p( • ) eL 2 (0 , T ; H^(D))** 
exists for Eq. 4.3.4. Let us denote the solution p(*) due to the appli- 
cation of control u(- )eL (0,T;U) as p U . Forming the inner product 
on L 2 (0, T; H^(D)) with x V -x U we obtain 


The standard asterisk notation for adjoint operators is used here. 
This should not be confused with the equally standard use of the 
asterisk superscript to denote such optimal quantities as u*(t), 
x*(t), and p*(t) . 

T* T* 

Actually, it must be true (and it can be shown) that p(.)eW(0,T). 
This is necessary since we wish to take L?(0 , T;H^ a (D )) inner 

products with ^ ^ . 
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< dp v u \ 

~rr ~ , x -x ^ _ 

L(0,T;H^(D)) 


< .* U V U V. 

A~p ,x -x > 


L 2 (0,T;H™(D)) 

(4.3.5) 

-<Q[x U -x ],x v -x U > 

L^(0,T;H™(D)) 


Evaluating the left-hand side of Eq. 4.3. 5, 


X V U> 

dt ’ x x ^ ? m 

L Z (0,T;H™(D)) 


= x V {t)-x U (t)> dt 

J H m (D) 

o 


= < P l *(T) , x v (T) -x“(T)> - f < P U (t), -fr{x v (t)-x u (t)> dt 


U (T),x V ( T)-x u V i/^ 

*0> o 

But since p u (T)=F[x U (T) -x ,(T)] as seen in Eq. 4.3.4, 


u 


<i£- , x *. x »> 


= <F[x u (T)-x,(T)],x v (T)-x U (T) > 


dt ^L 2 (0, T;H^(D)) ^ d 

-<p u > dr (xV - xU) >,2 

The first term on the^right -hand side of Eq. 4.3.5 can be written 


H m (D) 
o 


L 2 (0,T;H^(D)) 


<aV,xV > 2 m = < p U , A(x V -x U ) > ^ ^ 

ir(0,T;H™(D)) iTfO.TjH^P)) 

(4.3.7) 

Combining (4 . 3 . 5 ) , (4.3.6), and (4.3.7) and letting u=u* we obtain 


-<Q[x U ~-x d ],x V -x U *> 2 

L 2 (0,T;H” a (D)) 


o 

u . d .. . v u*. ^ 

= "< P ' ( dT A)(x " X >>2,„ „ TT m, 


(4.3.8) 


L (0, T ;H*; (D)) 


+ <F[x (T)-x (T)],x V (T)-x U ^'(T)> 


H^(D) 
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= -<p U *,B(v-u*)> 


ir(0,T;H“(D)) 


KF[x U V)-^(T)],x^T}-x U *(T)> 


H m <D) 

o 


Now from Eq. 4.3.3 

n(u*. v-u*) - L(v-u-) 

U* 1 V U : ’ : \ 

-<Q[x x d J,x x > l 2 (0jT;H ^ 1 ( D )) 

+ <F[x U *(T)-x d (T)],x V (T)-x U *(T) > 

H“(D) 

+ -^Ru* , v -u* ? =0 (4.3.9) 

IT{0,T;U) 


Combining Eqs . 4.3.8 and 4. 3 . 9 we obtain 
-<p U *,B(v-u*)> 


L 2 (0,T;H™(D)) 


= <Ru*, v-u*> 


L (0,T;U) 


Vv(-) 6 L (0,T;U) 


or, equivalently, 

- <J3*p U , v-U' : '^> p 

XT ( 0 , T ; U) 

= <Ru*,v-u*> p Vv(.)€L 2 (0, T; U) (4.3.10) 

L (0,T;U) 

2 

S ince equality must hold in Eq . 4. 3 . 10 for all elements v( • ) eL. (0 , T ; U) , 
it must be true that 

-B*(t) p u *(t) = R(t)u*(t) (4.3.11) 

is s atisfi ed by the optimal control u*. Moreover, since R(t) is 
assumed to be strictly positive in Definition 3.2 it has an inverse for 
all te[ 0, T] and s o Eq. 4.3, 11 reduces to 

u*(t) = -R _1 (t) B* (t) p u * (t) (4.3.12) 

It might reasonably be asked, at this point, why the above deri- 
vation does not hold as well for parabolic systems with system operators 
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which are the infinitesimal generators of a semigroup of operators. 

The answer lies in the fact that for this class of parabolic systems the 
costate Eq. 4.3.4 cannot be shown to have a solution p(* )eW(0, T) , 
that is, although p(.) is an element of L Z (0, T; H^(D)), we cannot 
show that ^7 ( - ) is an element of L (0 , T ; H^ 1 (D)) . Thus, the inner 
product with “pi(') in Eq . 4 . 3. 5 would be meaningless in this case. 
This inability to express the necessary conditions fox optimality in the 
form derived above for this class of systems will be circumvented in 
SectionB, however. 

Let us summarize the results of this section: 

z 

If u*(.)eL(0,T;U) is the optimal control for the problem specified 
in Def i nition 3. 4, then it is necessary that there exists a unique costate 
p*( . ) such th at: 

u*(t) = -R - 1 (t)B*(t)p*(t) 

where p*(* )eL Z (0, T; H^ 1 (D)) satisfies the equation 

_ A * p * (t) - Q[x*(t)-x d {t}] ; p*(T) = F[x*(T) - x d (T)] 

and x*(.)eL Z (0, T;H^ 1 (D)) satisfies the equation 

= -Ax* (t) + B(t)u*(t) ; x*(0) = x q 

4.4 DECOUPLING AND THE RICCATI OPERATOR EQUATION 

In this section the necessary conditions derived in the preceding 
section are shown to yield the fact that there exists a feedback form of 
the optimal control given by Eq. 4.3.12. The optimal feedback operator 
will be defined and will be shown to satisfy a nonlinear operator dif- 
ferential equation of the Riccati type. Bounded, positive and self- 
adjoint solutions to this equation will be shown to exist. Moreover, an 
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optimal cost function will be defined and shown to be simply related to 
the optimal feedback operator. The results of this section are due to 
Lions (see Ref. 15, pp. 147-157) with slight modifications and an 
extension to include the terminal -time cost. 

If we consider the system of equations: 

— = Ax*(t) - B (t)R _1 (t)B*(t)p*(t) 

te(s , T) ; 0< s < T 

(4.4. 1) 

-J" = -A*p*(t) -Qt**(t) -x d (t)] 

x*(s) = h, heH^(D) and p(T) = F[x*(T) -x d (T)] 

This system admits a unique solution pair (x*{ - ), p*( • ))eW(s , T)XW(s , T) , 
where W(s, T) is the space W{0,T) defined in Section Z.4 with s 
taking the place of the lower limit 0. This fact is easily seen if the 
cost criterion of the preceding section is defined on the time interval 
(s, T) instead of [ 0, T] and the same straightforward procedure of 
deriving necessary conditions is used. Lions shows that the trans- 
formation h— -{x*(-), p*{*)} is continuous from H^'(D) into 
W(s , T)XW{s, T), and that the transformation h— p*(s) is continuous 
from H^ fl (D) into H^ 1 (D), this latter result following from the fact 
that h— -p*(s) is a composite transformation composed of 
h-— {x*(.), p*(.)}, {x*(>), p*(-)} T*{T), and the transformation which 
relates the "initial" value p*(T) to the solution p*(s) of the adjoint 
equation in (4.4.1), all of which transformations are continuous in 
their range spaces. The result of the continuity of the transformation 
h-*-p*(s) is that p*(s) can be written in the form 


p*(s) = K(s)h+g(s) 


(4.4.2) 
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where K(s)e£(H^(D); H^(D)) and g(s)eH™(D) for all se(0,T). Since 
s is any arbitrary time in (0, T) and h is the evaluation of x*(s), 
then Eq. 4,4.2 tells us that 

p*(t) = K(t)x*(t) + g{t) Vte[0,T] (4.4.3) 

where x*(.) and p*(.) are the solutions of the state and adjoint 
equations, respectively, given, in summary, at the end of Section 3 of 
this chapter. It should be noted that K(s) in Eq. 4.4.2 is given by 
p*(s)=K(s)h in System 4.4. 1 with x d (t)=0, and likewise g(s) is given 
by p*(s)=g(s) with h=0. In the sequel, we shall, for convenience, 
drop the asterisk superscript notation for the optimal quantities x*{-} 
and p* ( • ) . 


The operator K(s) can be shown to be self-adjoint by con- 
sidering the scalar product <K(s)h 1 ,h ? > where k and h ? 

1 C H^(D) 1 

are initial conditions for System 4.4,1, with x d (t)=0 for all te[0,T], 
which result in the solution pairs {x^{ • ) , Pj(-)} and {x 2 (-), P 2 (‘)}> 
respectively: 

T 


t dp i 

J < " _ dT + A * p l + Qx l ,x 2^\ dt 


H^(D) 


<p.(T),x (T)> - <p 1 (s),x (s)> 

1 Z H^D) 1 L H m (D) 


dx. 


“ < p r "dT " **■> > 


2 L 2 {s,T ; h“(D)) 


+ <Qx,x > 

L Z (s,T;H^(D)) 


(4.4.4) 
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= <FxRT),x (T)> -<K(s)h h> 

H m (D) 1 C H m {D) 

o o v ' 


+ <p B(t)R _I B*(t)p > 

l/(s,T;H™(D)) 


+ <Qx 15 x 2 > 


L 2 (s,T;H^(D)) 


0 


and since the operators F, B(t)R ^(t)B*(t), and Q(t) are all self- 
adjoint, K(s) is self-adjoint. 

To show that K(s) is a positive operator, let us define the cost 
of starting at time s in System 4.4.1 with x^(t) = 0 and with initial 
state heH^ 1 (D) and control u( -)el?(s, T; U) as (u) . If 
u*( - )eL?(s, T; U) is the optimal control for this problem then u*(-) 
satisfies the necessary condition 


B*(t)p(t) + R(t)u*(t) = 0 te{s,T) (4.4.5) 

If h^=h 2 =h (it follows that x^(.)=x-,(-) and p^(.) = p 2 (*)h the last 
equality of (4.4.4) maybe written 


<K(s)h,h> = <Qx,x> 

H^(D) L^(s,T;H^(D)) 


+ '<p, BR 1 B*p> 


L 2 (s,T;H^(D)) 


+ <Fx(T) , x(T)> 


H m (D) 

o 


(4.4. 6) 


But, by virtue of Eq. 4.4.5, 


i j. 

/ <R(t)u*(t),u*(t)> u dt =f <B*(t)p(t),R" 1 (t)B*(t)p(t)> u dt 


(4.4.7) 


= /<P(t), 


B(t)R~ x (t)B*(t)p(t)> 


-1, 


H^(D) ' ' L 4 '(s,T;H x 0 li (D)) 


dt = <p, BR B ; i : p> 

T ^ 
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so that <CK(s)h,h> is the optimal cost starting at time s 

H m (D) 
o v 

with initial state h, or 

<K(s)h,h> = J h (u*) > 0 
s 

proving the positivity of K(s). 

The boundedness of K(s) follows from the fact that the - trans- 
formations h— x( • ) , -h— p( • ) » and h— x(T) are continuous in the strong 
topologies of their range spaces, so that 


x(-) |[ 2 


L Z (s,T;H^(D)) 


< c i|h|| , || p( • ) || 2 m 

H^{D) lf(s,T;H™(D)) 


5 c 2 |h II m 

H^(D)) 

and ||x(T) |l <c,,||h|| • From Definitions 3.1, 3.2, and 3.3 

H^(D) J H™{D) 

it is seen that the operators Q(t), R(t), and F are bounded so that 


<Qx,x> 


2 m <Mc 2 ||h|i 2 ,<p,BR" 1 B*p> 2 

lf(s,T;H™(D)) 1 H™(D) LT(s , T ; H™(D)) 


<Mc 2 ||h|| 2 , and <x(T),Fx(T)> 

H™(D) " H™(D) 


<M c? ||H II 2 . This 

H m (D) 
o ' ' 


implies that 
<K(s)h,h> 


m = 
H (D) 
o 


+M 3 4)||h|P 

H o (D} 


= c h 


H^P) 


(4.4.8) 


proving boundedness of the operator K(s). 

It will now be shown that K(s) is the solution of a nonlinear 
equation of the Riccati type and g(s) is the solution of a linear equation. 
Using Eq. 4.4.3 we can rewrite the system given in the summary of the 
preceding section as 
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x(t) = Ax(t) - B(t)R _1 (t)B-(t)[K(t)x(t) + g(t)] 
and dt = [ = K(t)x(t)+ K(t) x(t) + g (t) 

= -A*[ K(t)x(t) + g(t)] - Q(t)x(t) + Q(t)x d (t) 


(4.4. 9) 


and p(T) = K(T)x(T) + g(T) = Fx(T) - Fx d (T) ; x(0). = x q 

The two equations in (4.4. 9) can be combined to yield 


[ K(t) + K(t)A + A*K - K(t)B(t)R 1 (t)B*(t)K(t) + Q(t)]x(t) 

= E-g(t) -A%(t) + K(t)B(t)R' 1 (t)B^(t)g(t) + Q(t)x d (t)] 


(4.4. 10) 


since x(t) is arbitrary in the sense that it depends on an arbitrary 
choice of x , the only way equality can be achieved in (4.4. 10) is if 
the terms in each of the two square brackets sum to zero, or, equiva- 
lently, if the following two differential equations are satisfied 


K(t) = -K(t)A- A*K(t) + K(t)B(t)R _1 (t)B*(t)K(t) - Q(t) 

K(T) = F 

and g(t) = -^g(t) + K(t)B(t)R _1 (t)B4{t)g(t)+Q(t)x d (t) 

g(T) - -Fx d (T) 


(4.4. 11) 


(4.4. 12) 


We may summarize in part what has been shown above by stating 
the following theorem: 


Theorem 4. 6 : The optimal control u*( • )eL?(0, T; U) for the 

parabolic control problem specified in Definition 3 .4 is given by the 
feedback form: 

u*(t) = -R _1 (t)B4(t)[K{t)x(t) + g(t)] 


(4.4. 13) 
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where K(t), te[ 0, T] is the bounded, positive self-adjoint solution of 
Eq. 4.4.11 and g(t) is the solution of Eq. 4.4. IE. 

It remains to determine the relationship between the cost of 
starting at the initial state h at time s and the operator K(s) and 
function g(s) given above. This is stated as: 


Theorem 4.7 : The value of the parabolic cost criterion attained 
by System 4.4. 1 (the optimal system over (s,T)) is given by the ex- 
pression: 

J = <K(s)h,h> + 2 <g{s) , h> + 4>(s) 

H (D) FT7Eh 

o o ' 

where K(s) is the solution of (4.4.11), g(s) is the solution of (4.4, 12) 
md is the solution of 

<j>(t) = -<x (t),Q(t)x (t)> + <g(t),B(t)R _1 (t)B*(t)g(t)> 

H (D) H (D) 

o ' o 

(4.4. 14) 


4><T) = <Fx d (T), x d (T) > 
Proof : From Eq. 4.4.1 we may write* 


T T 

J <G(x-x d ), x-x d >dt - J < - ^ - A N p,x-x d >dt 

S s 

T • T 

= A *P> x > d * -/ <“!?- A- P ,x d >dt 

S s 


(4.4. 15) 


All inner products are defined on H^fD) unless otherwise specified. 
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Integrating the first term by parts, we obtain 


. A ^ p>x > dt=<p(s)>x(s)> _ <p(T) x(T)> 


+• J* p , x - Ax ^>dt 


(4.4.16) 


= <K(s)h + g(s) , h> - <F(x{T)-x d (T),x(T)> 

T 

- J* <^p , BR ^B*p ^>dt 

s 

Now, the second term in Eq. 4.4. I6may be written 

-<F(x(T)-x d {T)),x(T)> = -<F(x(T)-x d (T)),x(T)-x d (T)> 

(4.4. 17) 

‘ -<F(x(T)-x d (T)),x d (T> 

Moreover, from Eq. 4.4.7 we see that 

T T 

- J*< p, BR 1 B*p>dt = - j <u(t), R(t)u(t) > u dt 
s • s 

So that, by virtue of the equation for the cost functional J 
(Eq. 3.3.1) and Eqs . 4.4. 16 and 4.4.17, Eq. 4. 4. 15 may be 
written 

J =<K(s)h,h> 4<g(s),h> -<F(x(T)-x d (T),x d (T)> 

T (4.4.18 

+ f < dt + ^P^d^t 

s 

Examining the last term of Eq. 4, 4. 18, 
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X 

f <jtt + A *p. x d > dt = - J <Q(x-x d ),x d >dt 


(4.4. 19) 


i. X 

= J* <Qx d ,x d >dt -J* <Qx,x d > 


dt 


But, by the differential equation for g, Eq . 4.4.1Z, 


± X 

J < Qx , x d > dt = -J* <Qx d , x>dt 

s s 

T 

= - f < g + A *g “ KBR - 1 B^ g, x> dt 


= <g{s),x(s)> - <g(T),x(T)> 

T 

+ /<g, x-Ax + BR _1 B*Kx>dt 


amce g(T) = -Fx d (T) and x-Ax-fBR ^B*Kx=BR ^B^g, we 


obtain 


[ 

- j*<Qx,x d > dt = <g(s),h> + <Fx d (T),x(Tp> 


" /<g. BR 1;B *g> dt (4.4.20) 

. s 

Combining Eqs . 4.4.18, 4.4.19, and 4. 4 , 2 0 yields 

J =< K(s)h,h> + 2<g(s),h> - <F(x(T )-x d (T)),x d (T)> 

+ <Fx d (T),x(T)> 

T T 

+ /< Qx d’ x d> dt " / <g,BR _1 B*g>dt 



-77- 


= <K(s)h,h> + 2 <g( s) , h> + <Fx d (T),x d (T) > 

T T 

+ / <Q* d ,* d >dt -/< g, BR _1 B*g>dt 
s s 

= <K(s)h,h> + 2< g(s),h> + <j>(s) 

where <|> (s ) satisfies Eq. 4.4.14. 

Thus, we have achieved what was set out to be done in this section. 
The optimal control for the parabolic optimal control problem was shown 
to be a linear feedback control with a positive, bounded and self-adjoint 
feedback operator. It was also shown, by means of Theorem 4.7, that . 
the optimal cost is related to this feedback operator. It is important to 
note that the existence of this operator K(t) is guaranteed by the 
strong continuity of the transformation h — {x(’ ) , p( • ) } in W{0 , T)XW"(0 , T) 
Once existence is guaranteed, it is a trivial matter to determine what 
equation the operator K(t) must satisfy.' In the case of the operators 
treated in the nextliection, namely, the elliptic operators which are 
infinitesimal generators of semigroups, the transformation h— *-{x{ . ) , p( • ) } 
cannot be proved continuous, so that existence of the optimal feedback 
operator K(t) must be proved through other means. 

4.5 NECESSARY CONDITIONS FOR OPTIMALITY-- 

INFINITESIMAL GENERATOR CASE 

The necessary conditions for optimality have been derived for the 
case of coercive system operators and have been summarized at the end 
of Section 3 of this chapter; the resulting feedback form of the optimal 
control has been given in Section 4. The existence of the feedback 
operator K(t) and of the related vector function g{ t) and scalar function 
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<j>(t) are a consequence of the strong continuity in W(0,T) X W(0, T) 
of the transformation of the initial conditions into the solution (x(-), 
p{*)} of the canonical system of Eqs . 4.4.1. Such a canonical system 
maybe defined in the case where the system operator in the parabolic 
system Eq. 3.2.2 is a strongly elliptic operator satisfying Inequality 
2.7.1, or, equivalently, is the infinitesimal generator of a semigroup 
of operators. However, in this case, the transformation from the 
initial conditions to the canonical solution set -fx(-),p(.)} can only be 
shown to be strongly continuous in L? ( 0 , T ; H^(D)) XL^(0,T;H^(D)), 
Although this continuity feature was enough to guarantee existence and 
uniqueness of optimal controls for this type of system operator (as 
was shown in Section 2 of this chapter), it is not enough to guarantee 
the existence and uniqueness of a bounded, self-adjoint positive feed- 
back operator K{t) and the related functions g{t) and cj>(t). 

It is the purpose of this section and of the following section to 
show that the optimal control for parabolic systems with this class of 
operators has precisely the same feedback form given for parabolic 
systems with coercive system operators, namely, Eq. 4.4.13. In 
this section, we shall show that if a bounded solution K(t) exists for 
t he Riccati operator equati o n (4.4.11), then the fe e dback form in 
Eq . 4.4. 13 is the optimal control for parabolic systems with system 
operators which are infinitesimal generators of semigroups of operators . 
It will also be shown that the optimal cost function for parabolic systems 
with coercive system operators, given in Theorem 4.7, is also the 
optimal cost function for this class of systems. In Section 6 the im- 
portant question of existence and uniqueness of bounded solutions to the 
Riccati operator equation will be considered.- 
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Let us, for convenience, rewrite the cost criterion (3.3. 1) 

J = <x-x,,Q(x-x )> + <u,Ru> 

hr (0, T »H^(D)) ir(0,T;U) 

+ <x(T)-x (T), F(x(T)-x <T))> (3.3.1) 

a a H m (D ) 

o ' 

We shall show that the minimum value which J can attain is 

J • = <K(0)x(0),x(0)> + 2<g(0),x(0) >+ <j>(0) 

mm H (D) 

o v 

(4.5.1) 

where K(t) , g(t), and cfj(t) are given by Eqs . 4.4.11, 4.4.12, and 
4.4.14, respectively. Let us make the as sumption that Eq. 4.4.11 has 
a bounded solution K(t) defined on [0, T] . To show that this implies 
the existence of g(t) and (t ) we prove the foilowing lemma: 

Lemma 4. 1: If a bounded solution K(t) of Eq. 4.4.11 exists 

for all te[0, T] , then a unique solution g(t) of Eq . 4.4. 12 and, con- 
sequently, a unique solution <f>(t) of Eq . 4.4. 14 exist in the case of 
parabolic systems with system operators which satisfy Inequality 2.7.1. 


Proof : Rewriting Eq. 4.4.12 


= -A* g (t) + K(t)B(t)R _1 (t)B*(t)g(t) + Q(t)x d (t) 
g(T) = -Fx d (T) 


(4.4.12) 


Let us make the transformation t— T-s, so that Eq. 4.4.12 
be comes 

ff = A* g (s) - K(s)B(s)R -1 {s)B*(s)g(s) - Q(s)x d (s) 


g(0) = -Fx d (0) 
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Sin.ce the operator A is the infinitesimal generator of a 
strongly continuous semigroup of operators {$(t)}^^[Q 

as described in Section 7 of Chapter II, the adjoint operator 

A : ‘ : is the infinitesimal generator of the strongly continuous* 

semigroup of operators {$*(t)} as shown by Hille 

te[ 0, T] 

and Phillips {see Ref. 21,. p. 426). We may thus apply the 
variation of constants formula, Eq. 2.8.1 to obtain 
s 

g(s) = a(s) - $^{s - cr)K(cr) B(cr) R -1 (cr) B*{<r) g(cr) dcr (4.5.2) 

0 

s 

where a(s) = ^{s) g(0) -J" ^(s -<r)Q{<r )x^(<r)dcr 

0 

To show that a solution g(s), se[0,T] exists for Eq. 4. 5. 2 

we shall apply the well-known Picard method of successive 

approximations {see Ref. 26, p.6). Form the sequence of 

-00 

Sobolev-space valued functions {g.}.. q defined by: 
g o (s) = a(s) 
s 

g. +1 (s) = a(s) -J$*{s-a-)K{(r)B((r)R" 1 ((r)B*(cr)g.((r)d(r 
0 

We would like to show that the sequence converges 

in L (0 , T; H^ 1 {D ) ) to some limit g{ • ) . The convergence 
of this sequence depends on the convergence of the infinite 
s eries 

XI 

h(s) = X gi +i (s) " g i (s) 

i=0 

Strong continuity is a result of the reflexivity of the space L?(0, T;H m (D)) 
on which $’(t) is defined. . ° 
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The partial sum of this series has the form 
N 

h N (s) = Z (g i+l (S) " g i (S)) = g N+l (s) " g o (s) 
i=0 

and, so, the series converges in L (0 , T ; H^fD) ) if and only 
if the sequence converges in L? (0 , T; H^ 1 (D)) . The series 

co 

will converge if / ||g.,-,{*) “ gi ( * ) If ? ^ converges, 

fl'o 1+1 L(0,T;H“(D)) 


Now, 


g i+ l( s )- g i(s) = -/ ^(s-cr)K(,)B(cr)Il 1 (cr)B-(or)[ g^tr) “ g i _ 1 (cr) 3 dcr 


with the result that for i> 1 


^ g i+l (s) " g i (s} II 


H m (D) 
o 


s 

</||^ :i (s-o-)K{ ( r)B((r)R'V)B-((T)|| || g.(cr) -g. ,(cr) || do- 

0 H™(D) H™(D) 


By an argument similar to that appearing in the proof of 


Theorem 4.3 we may write the above inequality as 


[ g i+l (s )- g i (s> I* m 
i+i H (D) 

o ' ' 


a 

< c f hi 


(0-) -g ± _i(<r) II m 
1 1 H (D) 
o 


Moreover, since 

llg 1 (s)-g 0 (s) I m 

i o H m ( D ) 
o 
s 

< j*||$4(s-<r)K(cr)B(o-)R"V)B*( ( r) I! ||a(<r) || dcr 
0 


and since a(s), given in Eq. 4.5.2, is bounded in norm 
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on 


the finite interval [ 0, T] by Ha(s) || < a 


ll gl (s)-g 0 (s) 11^^ < 


acs 


From this it can be shown by induction that 

. .i+1 

l£*L 




and, thus, 


I, ,| < a .C.?.T) i+2 . 


so that 


X H y i+1 y JI L 2 ;H m (D)) - c [ 
i= 0 o 


cT 


(1 + cT)] 


and convergence is guaranteed for T finite. Thus, 
existence of a solution g(t) of Eq, 4.4. 12 is proved. 
Uniqueness will not be proved, but it is a straightforward 
matter to modify Bellman's uniqueness proof (see Ref. 26, 
p.‘8). The existence and uniquenes s of cjs (s ) follows di- 
rectly from the existence and uniqueness of g{s). 


We shall now show that the optimal control is given by Eq.4.4. 13 
and the optimal cost function is given by Eq. 4. 5.1, 

T heorem 4.8: If a bounded self-adjoint operator solution K(t) 

of Eq. 4.4.11 exists, then the optimal control is given by 

u*(t) = -R _1 (t)B*(t)[K(t)x{t) + g{t)] (4.4.13) 

and the optimal cost function is given by 
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J irt = <K(0)x(0),x(0)> + 2<g{0),x(0)> +4,(0) 

m H (D) H (D) 

o o 

Proof : Since g(t) and ^(t) exist by Lemma 4.1, let us write 

the identity 
T 

f-~-[<K(t)x(t),x(t)> + 2<g(t),x(t) > +4>(t)3 dt 

J dt • H m <D) H (D) 


0 


(4.5.3) 


-[ <K(t)x(t),x(t)> +2<g(t),x(t)> 

tT^(D) H^TD) 


+4>(t)lo = 0 


Performing the differentiation inside the integral and using 
Eqs. 4.4.11, 4.4.12, and 4 . 4. 1 4 to eliminate K(t),g(t), 
and 4>(t), respectively, Eq. 4.5.3 becomes 
T 


J <K(t)B(t)R' 1 (t)B-(t)[K(t)x(t)+2g(t)], x(t)> 


0 


H m (D) 
o 


dt 


i 

+ 2 /< B*(t)[ K(t)x(t)+g(t) ] ,u(t)> u dt 


(4.5.4) 


-L 

+ f <B(t)R“ 1 (t)B*(t)g(t), g(t)> 


0 

T 


H m (D) 
o 


dt 


- /< Q(t)(x(t) -x d (t)) , x(t)-x d (t) > 


H m (D) 
o v 


dt 


-[<K(t)x(t),x{t)> +2<g(t),x(t)> +4>(t)]^ = 0 

H^(D) H™(D) 


If Eq. 4.5,4 is added to the cost criterion, Eq. 3.3.1, and 
if the equality 
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J = <K(0)x(0),x{0)> + 2<g(0),x(0)> 4 4>(0) 

H^(D) H™(D) 

(4.5.5) 

T 

+ f <R(t)[R" 1 (t)B«(t)(K(t)x(t)+g{t))+u(t)], 

0 

R _1 (t)B*(t)<K<t)x(t)+g(t)+u(t)> u dt 

Since R(t) is a strictly positive operator, the integral term 
must be greater than or equal to zero, the latter occurring if 
the control u(-) is chosen to be 

u(t) = -R _1 (t)B* (t)[ K(t)x(t)+ g(t) ] 
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and the minimizing value of the cost criterion is 


J . 
min 


<K(0)x(0),x(0)> 

H o (D) 


+ 2 <g(0),x(0)> + 4,(0) 

H^(D) 


Thus, we have shown, in the case of parabolic systems with 
system operators which are infinitesimal generators of semigroups, 
that the assumption of the existence of a positive self-adjoint solution 
K{t) of the Riccati operator equation yields precisely the same results 
for the characterization of the optimal control and the optimal cost 
function as were obtained in Section 4 for the case of coercive system 
operators. All of this motivates a vital question, namely, under what 
circumstances, (if any), do solutions of the Riccati operator equation 
exist for the class of system operators under consideration 9 This 
question is treated in the next section of this chapter. 

4.6 EXISTENCE OF SOLUTIONS OF THE RICCATI 
OPERATOR EQUATION 

In this section it will be shown that bounded, posit i ve, self- 
adjoint solutions K(t) of the Riccati operator equation (4.4. 11) exist 
i n the case where the system operator A is the inf init esimal ge nerato r 
of a semigroup of operators . This will be achieved by using an ex- 
tension of the method of quasilinearization (see Ref. 27 , p. 19) used 
28 

by D. Kleinman to prove the existence of a solution of the matrix 
Riccati equation for finite dimensional systems. In brief, this method 
consists of demonstrating existence and uniqueness of a solution of an 
auxiliary linear operator equation and using this equation in an iterative 
fashion to prove existence of a solution of the Riccati operator equation. 
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If the differential operator A is assumed to satisfy the 
inequality, Eq. 2.7.1, then the operator , defined in Se ction 2 . 7 , 
is the infinitesimal generator of the strongly continuous semigroup of 
operators (${t) }^|- q , and the solution of the parabolic equation 

x(t) = Ax(t) + B(t}u(t) ; x(0) = x^eDofA^) 
may be written as 


x(t) = $(t)x Q + J $(t-o-)B(<r)u((r)d ( r 


(4.6.1) 


where B(oje<C(U ; IT (D)) and u(<r)eU. We shall have need in this 
section to discuss solutions of the linear operator equation 

= -V(t)A - A*V(t) - W{t) ; V(T) = F (4.6.2) 

where V(t) and W(t) are assumed to be bounded linear operators on 
[ 0,T] with domain equal to Do(A^) and F is the bounded self- 
adjoint terminal state -weighting matrix defined in Definition 3.3. For 
this equation we state'the following lemma: 


L emma 4 .2 : The solution of the linear equation (4,6.2) is 

given uniquely by 

T 

V(t) = #(T-t)F$(T-t) +f$*{<j-t)W(t)${cr-t)acr (4.6.3) 

t 

Proof : If A is the infinitesimal generator of a semigroup of 
operators, as described above, then its adjoint A* is also 
the infinitesimal generator of a semigroup (Sft)}^^ of 

operators, and it is easily seen that this semigroup has the 
property 
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= A ;,c S(t)x Vx eDo(A„) (4.6.4) 

dt o o o v 3 7 

Moreover, since S(t) is bounded on [ 0, T] , we have 

-~^x = )x = S*(t)Ax Vx eDo(A ) (4.6.5) 

dt o dt o o o 3 

It should also be noted that, from the discussion in the proof 

of Lemma 4.1, S(t) = $*(t) and S*(t)=$(t). If we let x q 

be an arbitrary element of Do'A^), Eq. 4. 6 . 3 may be written 

T 

V(t)x Q = S(T-t)FS^(T-t)x Q + j S(cr-t)W(t)S*(o--t)x o d<r 

t 


Diffe rentiating this expression we obtain 


V(t)x Q = S(T-t)FS*(T-t)x Q + S(T-t)FS"'(T-t)x o 


-S(0)W(t)S^(0)x o -Js((r-t)W(t)S^r-t)x o d(r 


T 

■/« 


S(cr-t)W(t)S v (cr-t)x o d ( j 


JL 

= -A*[S(T-t)FS*(T-t) + y*S(cr-t)W(t)S*(o-t)d<r]: 


-[S(T-t)FSf (T-t) S(o--t)W(t)S*(n-t)dcr]Ax o 


-W(t)x Q 

or, V(t)x Q = -V(t)Ax -A*Y(t)x Q - W(t)x^ 

Since x q is arbitrary, the Eq. 4.6.3 holds under the as- 
sumption that Do(V) = Do(A^), demonstrating existence and 
uniqueness . 



- 88 - 


We shall want to use this lemma to prove the existence of the 
auxiliary equation discussed in the introductory paragraph of this 
section. Let us state and prove the following: 

Theorem 4. 9: If L(t) is abounded positive self-adjoint oper- 

ator, defined on [ 0, T] , then there exists a unique positive, self- 
adjoint solution V T (t) of the equation 

.Li 

V (t) = -VltJtA-BttJR'VjB^ftlLft)] -[ A*-L(t)B(t)R -1 (t)B*(t)]V(t) 

(4.6.6) 

- L(t)B(t)R 1 {t)B-'(t)L(t) - Q(t) ; V(T) = F 

Moreover, if in the parabolic control problem specified by 
Definition 3.4 we let x^(t) =. 0 on [ 0, T] and require the control to 
be of the form 

u(t) = -R _1 (t)B*{t)L(t)x(t) (4.6.7) 

then the cost criterion has the value 

J = <V T (0)x , x > 

Proof : We shall prove the existence of the solution V^ft) in 

much the same way that existence was proved for g(t) in 
Lemma 4.1, that is, by means of successive approximations. 

For any arbitrary x^DofA^), Vj^(t)x Q must satisfy, ac- 
cording to Lemma 4.Z, the following equation: 

V (t)x = #*(T-t)F$(T-t)x 
J-j o o 

T 

+ t /*^ V (o‘ 1 -t)[L(cr 1 )B(cr 1 )R ^Oj) B-(cr 1 )L(o- 1 )+Q(cr 1 )] c^o-j -tjx^o-j 
t 

(4.6.8) 

(contd. on next page) 


H>, 
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T 

~f ^(cr j) B 5 » : (o'^)Ij(cr^) 

1 _i 

+ L(o- 1 )B((r 1 )R (o- 1 )B*(o- 1 )V L (o- 1 )]$(o- 1 -t)x o do- 1 

If we define M(t) to be the bounded operator 

M(t) = $*(T-t)F*(T-t) 

T 

+/ $*(o- r t)[L («r 1 )B{T 1 )R^ 1 (cr 1 lB-((r 1 )L{cr 1 )+Q( ( r 1 )] 
t 

$(<Tj-t)doj (4.6.9) 

then Eq . 4.6.8 can be written in the form 

V L (t)x o = M(t)x o 
T 

“/ (o-j -t) [ V L ) B( ^ ) R” 1 (o' 1 ) B* ( o-j) L( <r 1 ) 

t 

+ L (o r j)B((r 1 )R 1 (cr 1 )B"(cr 1 )V L (cr 1 )]${o- 1 -t)x o dcr 
Forming the sequence {V^ (t)}*? . , where 

-L i 1 =U 

_ V°(t) = M(t) , and 

■ M(t)x o 
T 

-J ^(o-j-tJtV^o-^Bfo-jJR^^JB^o-^Lterj) 

t 

+ B(ct 1 )B{(t 1 )R i (o-j)B*(oTj)'V^ j (cr 1 )]$((rj-t)x o do-^ 

for i> 1 

we obtain 

- vi(t»x o 

T 

t 

(4.6.10) 

(contd. on next page) 
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-/ $ *< (r r t) [ )B*(o- 1 )(V^o- 1 )-Yl'Vi) )] 


${cr 1 -t)x o do- 1 


From Eq. 4.6.10 and. by the self-adjointness of 
B ( a-^ ) , and R 1 (cr^) we see that 

H( V L +1(t > - V L (t))X oil m 

L L o R m 

o ' ' 


<2 j’||#'(o-t)L((r)B(cr)R' 1 (o-)B^(cr) |J 


* 0 > 


||(Y 1 ( < r)-V 1_1 ( ( r))$( ( r-t)x || do- 

L L o 


JL 

< 2 kj ||(V^(o-)-Y^‘ 1 (<r))*(cr-t)x 0 || 


H^P) 


since $ ‘ (cr-^) , L(cr^) , B(o-^) , and R (cr-^) are bounded operators 


We can, in turn, write 


ll< v ii <r i>' v L" 1 ( cr i)) $ ' :cr r t ) x 0 


H m (D) 

o 


x 2 

<2k/||(V^ 1 (o- 2 )-V ;L (<r 2 ))$(cr 1 -t)$(cr 2 -(r ;l )x o 


I d(r^, 

H m (D) 2 
o 


But, by the semigroup property (2.6.1), 
${tr 1 -t)$(o- 2 -tr 1 ) = $(<r 2 -t) 


so that we may continue this iterative substitution to obtain 

||(vi + 1 ( t )-vi(.»x o || 


Hq (D) 


T T 


<2 Vj j . .. f ll(VL(«r i )-V°(<r i ))#(o- i -t)x o 


t °1 °'i-l 


Hq 1 (D) 


da, da-, . . .da. 
12 x 
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T T T T 


<2V//.../ / 

t cr n o'- i o'. 

1 l-l l 


i$^(°-i +1 - cr i ) L M (o-i + i) B ( ( ri + i)R 1 (<ri + i) B ' r {o-i + i ) L ( (r i+ ^ 


+ L (°- i+1 ) B (^ 1+1 ) R 1 t <r H- 1 ^ B *^ <r i +l^M( <r i+l^ 


$((r i+r a 'i )$ ’' tr i" t)x o'' dcr r • - d<r ’ 


i+1 


XT T T 

2 *y//.../ / 

t o', tr. , c. 
1 l-l i 


H $ ' ::(cr i+r tr i ) II ll M(<r i+i) B (° r i+i) R 1 ( <r i+i) B *( <r i+i) ;L ( <r i-!-i> 


I! !|x o \\d(T l . . .dor. + 1 


- 1 , 


As before, it can be shown that ${t) , <5*(t ) , M(t), B(t ) , R (t), 
and L(t) are uniformly bounded on the finite interval 
[ 0 , T ] , so that 


V L +1|t) - V L (t)!x o ! m 
L L o H m (D) 

o 

T T T 


<Z i+ 1 k i J / .../ G lKII Hln(D) * r 1 --- d ^ + l 


t (T 1 C i 


T T 


<2 i+1 k i G||x o || J J ...J dcr r ..do- i+1 


t a - , c r. 

1 i 


2 1+1 k 1 G(T-t) 1 ' 1 ' 1 

(i+1) ! 


! X 


o 1 m 
H (D) 
o ' 


From this we determine the 1? (0, T; H^fD)) norm to be 



|[<V*V)-V^(.))x o l! 2 m 

L L o 1 f (0> T;H ™( D )) 


z i+ y G T 

(i+2) ! 


c II = _G_ (2kT) ' 

° H m (D) 2k 2 ^ 1+2 ^ ! 


° H m {D) 
o 


F rom this we can see that the infinite series 


yy(')' V ij')) X o 11 2 m 

^ ^ ° L(0,T;H^(D) 


converges and is less 


than or equal to [ e 2kT -(l+2KT}] |[x [j 

o H m 

o 


implying that 


we have convergence of {"V^( -)^_q to the solution ( * ) in 
2 ^ 

the L {0,T;H o (D)) sense. Thus, the existence of a solution 
to Eq . 4.6.6 is proved. Once again, as in the case of 
Lemma 4.1, the proof of uniqueness is a straightforward 
procedure and will be omitted. The self-adjointness of V^(t) 
can be deduced from the fact that its adjoint also satisfies 
Eq. 4.6.6, sothat, by uniqueness , V^(t) = V£(t). 

To prove the second part of the theorem, we examine the 
cost criterion for the parabolic control problem in the case 
where the desired state trajectory, x^(t), is zero. The cost 
criterion becomes 


J = <Qx,x> 


p + <CRu,u > 7 

L (0,T;H^(D)) L^(0,T;U) 


+ <Fx(T),x(T)> 


H m (D) 
o ' ' 


(4.6.12) 


If, in the identity 
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<V (0)x x > -<V <T)x(T),x(T)> 

L ° ° H m (D) L H m (D} 


JL 

= / [<V L (t)x(t),x(t)> 


H m {D) 

o 


+ <V (t)x(t),x(t)> +<V (t)x(t),x(t)> ] dt 

L H m (D) L H m (D) 

o o 

we replace VWt) and V^CT) by their specifications in 
Eq. 4.6.6 and use the value of x from the system equation, 
we obtain 

<V L <0)x o ,x o > ^ - <Fx(T),x,T)> 


Hr (D) 

O ' 


H (D) 

O ' ' 


= - [<Q( t)x(t)-, x(t) > dt 

0 «<>> 


-L 

J <L(t) B(t)R 1 {t)B* (t) L(t)x(t) , x(t) > 


H m (D) 
o ' 


(4.6.13) 


J. 

+ f <[V L (t)B{t)R“ 1 (t)B*(t)L(t)x(t) 


+ B(t)B(t)R _1 (t)B^(t)V T (t)x(t)+V T (t)B(t)u(t)]*(t)> dt 

L L H m (D) 

o 


JL 

+ j <V L (t)x(t),B(t)u(t)> 


H m (D) 
o ' 


Now, using the fact that the control u(t) satisfies Eq. 4.6.7, 
the second integral in the right hand side -of Eq, 4.6.13 can 


be identified as 
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t 

0 


H 



dt 


- - <CRu , u ? 

L (0,T;U) 

and the last two integrals on the right-hand side of Eq. 4.6.13 
sum to zero, so that 


J = <Qx,x> _ +<Ru,u> ? 

ir(0,T;H^(D)} LT{O f T;U) 


+ <Fx{T) , x(T) > 


= < C.V r { 0)x , x > 

H m (D) L ° ° H m (D) 

o o 


proving the second part of the theorem. Since, from the above 
discussion, the system could have been started at any time 
te[0, T] with initial state x , it must be true that 


J=<Qx,x> ? + <Ru,u> 

L (t, T;H^(D)) 


+ <Fx(T),x(T)> 


H m (D) 

o 


E (t.TjU) 
= <V ^ t)x o' X o > 


and since J> 0 for all initial states x^eDo'A^), it follows 
that 


<V L (t)x o ,x o >> 0 Vte[0,T], Vx Q eDo(A 3 ) 

demonstrating the positivity of the solution (t) and 
completing the proof of Theorem 4.9. 


Corollary 4.1 : If, in the parabolic control problem, we specify 

0, the solution of Eq. 4.6.6 becomes 

T 

V (t) = $*(T-t)F$(T-t) + jf (tr -t)Q (cr) <f>( cr -t) dtr (4.6.14) 

t 
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P roof : Equation 4.6.14 results from L(t) being the zero 

operator in Eq. 4.6.8. 

We shall now generate a sequence of positive, self-adjoint linear 
operator functions {V n (t)} n _g > t e [ 0 » T ] , each with domain equal to 
DofA^). This sequence will be generated by the method of successive 
approximations in Eq. 4.6.6. The elements in the sequence will be 
shown to be bounded and monotonically decreasing in a sense which will 
be defined. It will then be shown that the sequence has -a limit and this 
limit is the solution of the Riccati operator equation. 

The sequence (V^{t)} is given by 

v o (t> = 0 

V n-K {t) = ' V n+ 1 (t)[ A -B(t)R" 1 (t)B*(t)V n (t)] ■»[ A^V^tJBftJR^tJB^t)]^^ 

(4.6. 15) 

-V n (t)B(t)R _1 (t)B^(t)V n (t) - Q.(t) ; V^fT) = F 

It should be noted that Eq. 4. 6. 15 is equivalent, to Eq . 4.6.6 with 
Vj^(t) = V (t) and— L(t) = V^(t) . One of the properties we would like to 
show that the sequence of operato.r functions possesses is that of mono- 
tonically decreasing positivity. We make this concept precise in the 
following definition: 

Definition 4 . 4: If and are both positive linear operators, 

then P^ is said to be greater than or equal to P^, denoted E > P^ if 
the linear operator (F^-P,) is a positive operator. 

We shall now state and prove a lemma which will be useful in the 
characterization of the sequence of operators: 
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then 


Lemma 4.3 : If X^ and are self-adjoint linear operators, 


X ] BR l B*X l = X 1 BR _1 B*X 2 4 X 2 BR" 1 B*X 1 - X 2 BR _1 B*X 2 4 N 

(4.6. 16) 


where N is some positive operator. 


Proof ; Equation 4. 6.16 follows from the fact that 

(Xj - X 2 ) BR“ 1 B*(X 1 -X 2 ) > 0 

which implies that 

XjBR^B-Xj > X 1 BR' 1 B*X 2 4 X 2 BR' 1 B*X 1 - X 2 BR _1 B*X 2 
and this implies that there must exist a positive operator N 
such that Eq. 4.6.16 holds . 

CO 

The desired properties of the sequence {V (t)} are now 
stated in the following theorem : 

Theorem 4. 10: The elements of {V(t)^_ Q are bounded, and the 

sequence is monoton ically decreasing for n :> 1 , in the sense that 

Vi(t) > v 2 (t) > v 3 (t) > 

Proof : (By induction) V^(t) is given by the expression in Eq. 4.6.14, 

which is clearly bounded and positive. According to the se- 
quence generation formula (4. 6. 15): 

V 2 ( t) = -V 2 (t) A ]( t) -A* (t) V 2 ( t) - V^t ) E(t) R \t) B*(t ) Vj(t ) - Q( t) 


(4.6.1?) 
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where 

A x {t) = A - B{t)R" 1 {t)B-(t)V 1 {t) 

The self-adjoint solution V 2 (t) exists and is positive by 
Theorem 4.9 (with V 2 (t) = V^Jt) and L(t) = V^t)). We may 
write the following differential equation for S-V^t) =V^(t) — "V^, (t) 

SV 2 = - (Vj(t) -V 2 (t)) A -A^t) - V 2 (t)) -V 2 (tmm~\t) B*(t) Vj(t) 

-V 1 (t)B(t)R" 1 l(t)B*(t)V^t)+V ] ([t)B(t)R“ 1 (t)B !t Tt)V 1 {t) 

= -6V 2 (t)[A-B(t)R 1 (t)B*(t)V 1 (t)] 

-[ A* -V ] (t)B(t)R" 1 (t)B*(t)] 6V 2 

- V 1 (t)B(t)R“ 1 (t)B-(t)V 1 (t) (4.6.18) 

with terminal condition 8V 2 (T) = 0 

for which a positive solution 6V 2 (t) for all te[0,T] exists 
by Theorem 4.9 (with V^(t) =6V 2 (t), Li(t) =Y^(t) , Q(t)=0 , and 
F = 0). Since both V^(t) and V 2 (t) are positive this means 
that 

V x (t) >V 2 (t) V tc[ o , t] 

Moreover, since V^(t) is bounded, then V 2 (t) must also be 
bounded by the following argument: 

Since V 2 (t) is positive and self-adjoint, we can apply 
the gene ralized Schwartz inequality (see Ref. 22 , p.262) 
l<V 2 (t)x,y> j 2 < <V 2 (t)x,x>< V 2 (t)y, y> 


and obtain 


l|V ? (t)H = 


<V 7 (t)x, y > | 


S YP. 


M/o 11*11 I[y II 
|y ll/° 




sup - 

|x ||/t> 


< V 2 (t)x , x> 


sup 


<V 2 (t)y, y>‘ 


ylf/O llyir 


. 1/2 
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sup 

MMo 


<V 1 (t)x,x> 


X 


sup 

lly II / 0 


<V l (t)y,y> 


1/2 


< llv^t) 


hence, proving the boundedness of Y^{t) . 

To continue the induction process we assume that 

V ,(t) is bounded and Y , (t) > V (t) . We must prove that 

n-1 n-1 — n' 

V (t) > V , (t). Now, V (t) satisfies the equation 

n' ' — n+l v ' n w 

Y/t) = -V n (t)A n _ 1 (t)-A- n _ 1 (t)V n (t) -V n _ 1 (t)B(t)R" 1 (t)B%)V n ^ 1 (t)-Q(t) 
with A At) = A - B(t)R" 1 {t)B*(t)V ,{t) 

n-i n-l 

We may apply Lemma 4.3 to the next to last term of this 


equation to obtain the following equation for V (t) : 


V(t) = -V n (t)A nl (t)-A* _ 1 (t)V n (t)-Q(t) -V n _{tmt)R\t)Bnt)V n (t) 

-V n (t)B(t)R" 1 (t)B-(t)V n _ 1 (t)+ V n (t)B(t)R" 1 (t)B^t)V n {t) -N(t) 

where N{t) is some bounded, self-adjoint positive operator. 

By a rearrangement of terms similar to that done to obtain 
Eq. 4.6.18 this can be shown to be 

V/t) =-V n (t)A n (t) r A' n (t)V n (t) -Q(t) -V n (t)B(t)R" 1 (t)B^(t)V n (t) -N(t) 

with A n (t) = A - B(t)R" 1 {t)B^(t)V n (t) 

But, since a positive self-adjoint V^_j(t) exists by Theorem 4. 9 
and in addition satisfies the equation 


^n+l (t) = -V n+1 (t)A n {t)-A*(t)V n+] (t)-Q(t)-V n (t)B(t)Rit)B*(t)V n (t) 
then the differential equation for the difference linear operator 


6V n«<‘> = 


V n<‘> 


V n + 1<‘> 


is given by 

6V n + jf t ><< t >-\ + l (t >=- 6 L + l<*> A n< t >-Y< t)6V n + l (t >- N(t > ; 6V n«( T > = ° 


(4.6.19) 
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The proof that the solution of this equation is positive for 
all te[ 0,T] is given in Appendix E so that we know that 

6V n+1 (t),>0 ¥te[0,T] 

which implies that 

V n { t) > V n+ i ( t) Vte[0,T] 

and V^^t) is bounded on this interval by the same argu- 
ment which proved V^ft) bounded, completing the proof of 
Theorem 4.10. 


It remains to achieve the stated aim of this section, that is, the 
demonstration of the existence of the optimal feedback operator K(t) . 
This will be done by showing that the sequence {V (t) } q ^ as a lhm.it 
and this limit is precisely K{t). We state this as a theorem: 


Theorem 4,11 : A self-adjoint, positive, bounded solution (t) 
of the Riccati operator equation (4,4.11) exists and is the limit (in 
H^^D}) of the sequence of operators {V^t)}^^ defined by Eq. 4.6.15. 


Proof: We shall show that lim V (t)x exists for all xeH (D) 

n o 

n— ’-co 

and that the limit satisfies the Riccati equation (4.4. 11) . 
For any x Q eH^ a (D), we have from Theorem 4.10 that 


<V.(t)x x 

1 o ° H“(D) 


><V,(t)x ,x > _ > 

2 0 ° H m (D) - 

O 


Moreover, Theorem 4.10 also implies that each of the ele 


ments of the sequence {<TV (t)x , x > } , is bounded 

n ° ° H m (D> 

o ' 

below by 0. Since any mono tonically decreasing sequence of 



- 100 - 


real numbers which is bounded below has a limit, we know 

that lim <V (t)x , x > exists for all x eH^ 1 (D). 

„ n o o ^ m.-p., o o 

n— «> H q (D) 

To show that the limit is of the form <C V 0 (t)x Q , x q ]> 

we use a well-known theorem (see Ref, 29, p. 189) on 
linear operators in a general Hilbe rt space H, which states 
that a monotone decreasing sequence of positive, self-adjoint 
operators fV , has a limit V in the sense that 

V x = lim V x , for all xeH 

co n 

n — - co 

Using this result and Theorem 4.10, with the Hilbert space 
H = H^(D) „ we may conclude that 

V^tjx = lim V (t)x , for all xeH^(D) and te[0,T] 
n— -«> 

Now that we have shown that V, (t) exists, we shall show 
that it satisfies the Riccati operator equation (4.4.11), Inte- 
grating both sides of Eq. 4.6.15 from t to T we obtain 
T 

V n+l {t)_F = / {V n+l ((r) ^ A -V n+1 (cr) B( ,r)R“ 1{ o-) B - ( cr ) Y i ( ( r) + A ; V n+1 (< r) 
t 

-V n (<r) B :< (o-)V n+1 (cr)+V n (a) B(<r)R ^o-) B*<tr)V n (o-) -Q(cr) }d«r 

Taking the limit as n approaches infinity, 

T 

Vj(t) -F =J* {Y{cr)A-VJ(r)B(o-)R"V)B*((r)^((r)+A*V o ((r)-Q((r)} dcr 
t 

(4.6.20) 

Equation 4. 6. 20 shows that ^ (t) is continuous in t and 
differentiable, s o that by differentiating (4.6.20), we obtain 

V,(t) = -V 0) (t>A-A*V ti (t)+V 0 (t)B(t)R' 1 (t)B*(t)^ > (t) - Q(t) ; VJT) = F 


H>> 



showing that (t) satisfies the Riccati operator equation. 

VJt) is clearly self-adjoint and can be shown to be bounded 
by application of the generalized Schwartz inequality in pre- 
cisely the same fashion as was done in the proof of Theorem 4.10 

We have thus satisfied the hypotheses of Theorem 4.8 and can 
identify the operator V, (t) with the optimal feedback operator K(t) . 

It should be noted that although the results appear to be the same in both 
the case where the system operator is coercive and the case where the 
system operator is the infinitesimal generator of a semigroup of oper- 
ators, there is a difference. In the coercive case the results hold for 
all initial states in the state space H^{D), whereas, in the latter case, 
the results hold only for initial states in the domain of . This is not 
as restrictive as it might seem, since the domain of A 3 is dense in 

H^{D), and, thus, in the latter case, K(t)x can be defined for 
2m 

xeH Q (D) and x/DoJA^) by letting 

K{t)x = lim K(t)x 

n — co 

f -.CO 

where \ x n / n _^ is a sequence of points in DofA^) converging to x. 

Since K(t) has been shown to be bounded, the sequence {K(t)x }** , 

n n=l 

has a limit. 

W ith the matter of existence of solutions to the Riccati operator 
equation resolved, let us briefly consider the problem of actually 
solving the Riccati operator equation. Since the space H q (D) is a 
separable Hilbert space, there exists a basis where each <}>. 

in the basis is an element of H^(D), such that any element xeH^(D) 
has the unique representation 
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= Z 

j=l 


where the coefficients x_. are given by 


x. = <x, 4>.> ‘ 

3 ^ H m 

o 


j— 1 ,2, ... 


Thus, we may consider an element xcH q (D) to be alternatively re- 
presented by the infinite dimensional vector x, with j** 1 component x„ 
If L is any linear operator from H^D) into H^D), we have for 
xeH^(D), 


Lx = L ) x-4>. = } x.Lcb. 

yj /L, j 

j=i j=i 


Now, Lr<|). is an element of H (D) so that 
*5 


■ Z hi*i 


L . . = Lcb . , <f>. 
ij V i 


H^(D) 


i=l,2. 


j=l , 2, . . . 


Thus, Lx maybe represented by Lx , where L is the infinite matrix 
th 

with ij 1 element L„ . Similarly, since U is a separable Hilbert 

space, with basis { each element ueU may be considered to be 

an infinite dimensional vector u , and the control operator B(t) may 

th 

be considered to be the infinite matrix B(t) with ij element 

B (t) = <B<t)# + > 

3 3 H^(D) 

Let us, for the purpose of illustration assume that R(t) = I, the 
identity operator. We may now rewrite the Riccati operator equation as 



-103- 


the infinite dimensional matrix Riccati equation 


K(t) = -K(t) A-A’K(t) + K(t)B(t)B '(t)K(t) - Q(t) ; K(T) = F 
where all of the matrices in this expression are uniquely determined 
in the fashion prescribed above. 

As an example of this procedure let us consider the scalar heat 


equation (see Section 2.4), with coefficient p=l , on the domain D=(0,1) 


= 1-ib.SL + u(t , z) . x (0,a) = xjz) 
dz 


with boundary conditions 

x(t,0.) = x(t, 1) = 0 
Let us choose the cost functional to be. 


T 

J = ft iMt)|| 2 2 + ik(t)|| 2 2 ]dt+ |[x(T)|[ 2 

* H (D) H (D) H (D) 

0 o o • o 


which corresponds to choosing the operators Q(t), R(t), and F to be 


the identity operator on H q (D),- which has the infinite matrix representation 



The Sobolev space under consideration is H q (D), and we shall choose 


the countable basis { 4~ 4 s ^ nn ' ir2 '^i * ^ eas ily seen that 

1+n ir +n it 

2 

this is an orthonormal set in the H q (D) norm. Using this basis, the 

8 2 th 

operator A = y has the matrix representation A the ij element 

dz 

of which is given by 
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A. . 
ij 


= < 


_9j Ma) 


3 z 


<M z) > 

1 H 


m 

o 


(D). 


where 8. . 

ij 

matrix 


<-jV> (z), 4> (z)> 

J H 

o 

is the Kronecker delta. 


(D) 

Thus 


2 


-it 


• 2 2 S 
“1 IT 8. . 


A is the infinite diagonal 


A 




0 



■and the system partial differential equation may be written iii the form 

x(t) = Ax (t) + u (t) 

and the Riccati operator differential equation may be written as 


2 


2 

-IT 

" 

-TT 

-4ir^ 0 


-4 TT 2 0 

„ 2 


„ 2 

-9ir 


-9tt 

0 


£ 

• 






1 

; K(T) = 

l 

1 0 


i £ 

1 


i • 

£ 


£ 

^ __ 


* 


It is possible to truncate these matrices and solve the resulting 
finite dimensional matrix differential equation for an approximate value of 
the K(t) matrix. Approximations of this type will be discussed, in a 
slightly different context, when we consider modal analytic solutions in 
Chapter V. An alternative, way of determining optimal feedback solutions 
will be presented in Section 4.8. 
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4.7 INFINITE TIME SOLUTIONS 

In the parabolic control problems considered, in the preceding 
sections, it has been assumed that the terminal time T is finite. In 
this section we examine the behavior of optimal solutions when T ap- 
proaches infinity. It will be shown that an optimal solution and a 
solution of the Riccati operator equation on [ 0,<»} exist in the case 

where the system operator is coercive. These results are due to 

15 

Lions. Moreover, it will be shown that, under the assumption of 

complete controllability, a solution of the Riccati operator equation 
exists on [ 0,<*>) in the case where the system operator is the in- 
finitesimal generator of a semigroup of operators. For both types of 
system operators it will be shown that the Riccati operator equation be- 
comes an algebraic t ime -inva r iant operator equation when the oper- 
ators B(t), Q(t), and R(t) are assumed to be time - invariant . 

Let us assume that the operators B(t), Q{t), and R(t) are uni- 
formly bounded on Moreo.ve'r, let us also assume, that there is 

no terminal -time weighting cost, i.e., F =0 . Existence and unique- 
ness of the optimal control on the infinite time interval are guaranteed 
in the case of coercive system operators, since H (u, v) in Theorem 4. 1 
is still a symmetric, coercive bilinear form continuous in u and v 
even though u and v are controls defined on the infinite interval 
[ 0,co) . Theorem 4.2 still holds and tells us that u(t) = -R ^(t)B*p{t) for 
all te[ 0,oo) . It remains to show that the costate is well defined on the 
infinite interval, and that it can be written as 

p(t) = K(t)x(t) + g{t), V-te[ 0,«>) 

Lions^ 15 ’ 15 ' 181 shows that, if the desired state x d ( • )eL 2 (0 ,«> ; h“(D)) 
then the costate equation 
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^ = - A*p{t) -Q(t)[x(t) -x d (t)] 

has a unique solution ^ ( .)eW(0,®) If s is any fixed time in [0,®>) and 
x(s) = heH^D) the argument in Section 4.4 can be duplicated to show 
that the transformation h— *-p(s) is continuous from (D) into H^ 1 (D), 
so that we can write 

pjs) = Vsjhh ^(s) 

or, by using the fact that se[0,®) is arbitrary, we conclude that 

P^t) = K^x^t) + ^(t) ¥te[0») 

where x^ (t) is the solution of the system equation on [ 0,c°). The re- 
maining arguments of Section 4.4 are extended without difficulty to 
show that K (t) satisfies the Riccati operator equation 

. CO 

K ro (t) = -K o (t)A - A^K^ (t) + ^(tJBftJR^JB^tJKJt) - Q(t) (4.7.1) 

and g^i-) is an element, of W(0,w) which satisfies 

^(t) = -A * g Jt) + K ffl (t)B( t )R _1 (t)B*(t) gco (t) + Q(t)x d (t) (4.7.2) 

and the minimum cost using the optimal control u£ (t)=-R (t)B*(t)[ K Q (t)x(t) 
+ g c0 (t)]" on the interval [s,®) is given by 

J = <K 0 (s)x„( S ),x ofs) > Hm(D) + 2<6 to (=),x IO (s)> Hm(D) 

(4.7.3) 

Under the assumption that B(t) = B, Q(t) = Q, and P.(t) = R 

Licns^’ shows that the transformation h— *-p(s) is 

independent of s, so that we may write 

p{s) = K^hb ^(s)' 

Note that since x fl (.)eW(0,®) and gc0 (- )eW(0,«) , then lim u*(t)=0. 

t— XO 



- 107 - 


where is the solution of the time -invariant algebraic operator 

equation 

K A + A*K - K BR'^K +Q = 0 (4.7.4) 

and g^ft) is the solution of the time -invariant differential equation 
gjt) = + K^BR 1 B*g aJ {t) + Qx d (t> 

So it is seen that the important point of continuity of the trans- 
formation of the initial conditions to the costate carries through in a 
straightforward manner and enables us to obtain results for the in- 
finite terminal time case similar to those obtained in Section 4.4 for 
the finite terminal time case. Once again, this continuous trans- 
formation cannot be defined in the case where the system operator is 
the infinitesimal generator of a semigroup of operators. Moreover, 

the infinite time version of Theorem 4.3, namely, the fact that the 

2 2 xn 

mapping u(-) - 1 -x( . ) of L {0,co;U) into L (0,<»;H o (D)) is continuous, 
cannot be proved, since inequalities in the proof do not hold on the in- 
finite time interval. This results in an inability to use Theorem 4.5 
to prove existence of a unique optimal control in this case. In order 
to prove existence of an optimal control we must make use of the con- 
cept of complete controllability and proceed by limiting arguments to 
the characterization of the optimal control. 

Let us make the following definition:^ 

Definition 4. 5 : The parabolic system 

x = Ax + Bu ; x(t) = x eH m (D) 

' ' o o 


This is just the application of the standard definition of complete con- 
trollability (see Ref. 24, p.200) to parabolic systems. 
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is said to be completely controllable if for every te[ 0,<») and for 
every x eH^fD), there exists a time t and a control u q (-) 

defined on [ t, t^] such that the state at t^, x(t^) = 0. Another way to 
interpret this definition is that if the state at anytime t^ can be re- 
presented by a linear transformation on the control u(-) defined on 
[t, t-j] , namely, 

x(tj) = L^u + cj>{t 1 ) 

then we would like to find a control u q and a time t^ such that 


L tl % = -+ (t i> 

if ^(tj) is arbitrary, this requires that the range of L t , denoted 

R(L ), be all of H m (D) for complete controllability. But, since 
t l ° 

R(L IA ) = R(L ), we require R(L ) = H (D), or, in other words, 
t l *1 r l T 

we require L Lr to be invertible. Clearly, in the case under con- 
t l t l 

side ration, 

h 

L t u =J ${t 1 -o-)B(c-)u(a-)do- ¥ueL^t, U) 

1 t 

and 

(IT z)(t) = B*(t)&(t,-t)x VzeH™(D) 
t l 

so that the operator L I? is 

t X 1 


L Lf = / ${t 1 -«r)B( < r)B*(o-)«*(t 1 -o-)do- (4.7.6) 

*i ‘i { 1 1 

Thus, complete controllability is equivalent to finding a time t^ for 
which L. 17 

r l 1 


in (4.7.6) is invertible. 



With the introduction of the concept of controllability we shall 
now be able to deal with the problem of proving the existence of an 
optimal control and the convergence of the feedback operator when the 
terminal time approaches' infinity. Toward this purpose, let us denote 
K{t, T; F) to be the optimal feedback operator for the control problem 
with cost criterion 


J =<Qx,x> + < Ru, u> 2 

L {t, T;H“(D)) L(t,T;U) 

+ <Fx(T),x(T)> 

H™(D) 

Let us also denote J(x, t, T, u( •)) to be the cost of starting at time t 
with x(t) = x and applying the control u( . ) on [t, T] . Now, if the 
system is controllable, then there exists a time t-^ such that the con- 
trol 

u (t) = -B*(r)T{T-t)(L. L* fx ; t e [ t , t, ] (4.7.7) 

o r l 1 

results in the desired transfer to 0. This is easily seen 
by using Eqs . 4.7.6 and 4. 7. 7 in the parabolic system equation with 
initial condition x(t) =x. Moreover, by using Eqs . 4 . 7 . 6 and 4. 7 . 7 , 
it can also be shown that 


/<a o (T).u o (T)> u dT = < (qL*rk,x> 


H m (D) 
o ' 


and, since R(t) is uniformly bounded with [ J Ft ( t) j|< r 
t l 

/ < R(T)u 0 (T),u 0 {T)> u dT < r<(L t L* i r 1 x,x> • 


(D) 


Thus, we can obtain an upper bound on the cost due to application of 
control u q (.) on [t,tj . if 


we assume that we can obtain an 
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upper bound for the trajectory cost using control u ( - ) by showing 


that 


/<Q(t)x(t),x(t)> dt = <c(t,t,)x,x> 

* fl {D) 1 H m (D) 

t o' 7 o '• ' 


whe re 


i 

(t.tj) = /[i-(L T i7;){L ti ^rY^ 5 MT-t)Q(T)$(T-t)[i-(L T Lj)(L ti i?r 1 3d- 


i ■•1 


The precise form of this expression is messy, but the important fact 
to note is that since t^ is finite, c(t, t^) is positive and bounded. We 
can now find abound on K(t,T;0), T> which is independent of T, 
namely, by application of the control u q { • ) on [t,T] where 


u (t) 
o 


f u 0 ( T > t<T<t x 


T > 


we obtain 


J(x,t,T;u (.))=J(x,t, t,,u (.))< <c(t,t 1 )+r(L I? ) 1 x,x> 

jL r l H -(D) 

o 

But, since 

< K(t,T;0)x,x> = min J(x,t, T,u(-)) 

u(-)eL 2 (t,T;U) ■ 

we obtain 

K{t , T; 0 ) < c(t, t,) + r(L, ) _1 

1 1 

noting that the bound is independent of T. 

It is now a fairly straightforward matter to prove that the in- 
finite time solution exists and that 

lim K(t,T;0) = K{t) 

T— -co 
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where K(t) is the solution of the Riccati equation on the infinite 
interval. Indeed, the arguments used are exactly parallel to those 
used by Kleinman 2 ^ ’ ^ ^ m the case of finite dimensional sys- 

tems, so that they will not be stated in great detail. 

If > T-^ we have, by the principle of optimality, that 

<K(t, T ? ; 0)x, x> = min J(x , t, X, , u( • ) 

H^(D) u{.)el/(t,T 2 ;U) 


mm 

u(.)€L 2 (t,T 2 ;U) 


C* 

[ J(x,t,T 1 ,u{.)) + f <Q(t)x(t),x(t)> 


H m (D) 
o 


d T 


+ < Ru, u > ? 

L (T^T^U) 


><K(t, Tp 0)x,x> 


H m (D) 

o 


mm 


a 

[ r<Q(T)x(T),x(T)>dT + <Ru,U> ? J 


u(-)eL (T r T 2 ;U) T x 


L (T r T 2 ; U) 


since the second term on the right hand side of the inequality is positive 
it must be true that 


K(t, T^; 0) < K(t,T 2 ;0) 

If we form a sequence of terminal times {T with T_^> T\ and 

lim T. =<», -we know that the sequence {<CK(t, T. ; 0)x, x^> }?°,is 

i — hb 1 1 H m (D) 1 

o 

monotonically increasing. However, we have also shown that this se- 
quence is bounded, independent of T., by [ c (t , t-j ) + r{L Ii; ) X ]x,x> 

•1 1 



which implies that the sequence converges for any fixed x and t. By 
arguments similar to those used in the proof of Theorem 4.11 in the 



- 112 - 


preceding section, we can write this limit as < K{tp ; 0)x,x> 

H™(D) 

Let us denote K(t,®;0) as, simply, K^t). (t) can be shown to 

be the solution of the Riccati operator equation on [t,®) by proving 
that for all t a £[t,tf>), (t) =K(t,t a ; K^tJ) , which is the solution of the 

Riccati equation on [0,t ] with terminal-time weighting operator 
K^(t ). Using the fact that the solution of this equation is continuous 
in the terminal condition K (t ) we obtain, for t < t, 

CD cl d U 

K (t) = lim Kft.t^O) = lim K(t, t^; K(t a ,t b ; 0)) 

V-" V" 

= K(t,t a ; lim K(t a ,t b ;0)) = K(t, t & , 

f — *-cO 

c b 

2 

The proof that the optimal control, u*( • )eL' (t,® ; U) , for the infinite terrr 
nal time parabolic control problem with x^=0 is given by u*(t) = 

-R _1 (t) B*(t)K (t)x(t) for all Te[t,<*>) and that the minimum cost is 

J = <K (t)x,x> is exactly the same as that given by 

H m (D) 

o 

KLeinman 2 & Theo j r ?j rtl 5 for the finite dimensional case, so it will be 
omitted. 

The demonstration of the fact that if B(t)=B, Q(t)=Q, and R{t)=R, 
then K^(t) = , satisfying the algebraic operator equation (4. 7.4), 

is precisely the same as that used by Lions in the coercive system 
operator case. 

The above results were obtained under the assumption that the 
desired state trajectory x d (t)=0. Let us now assume that this is not 
the case and x d (t) is the solution of the equation 

x d (t) = Gx d (t) ; x d (0) = x do (4.7.8) 

where G is a linear spatial differential operator which satisfies the 



-113- 


conditions of Section 2.7 and is, therefore, the infinitesimal generator 

of a semigroup of operators, and is some arbitrary element of 

o 

H^ 1 (D) . If we now consider the error function 

e (t) = x(t) -x d (t) 

The cost functional (3.3.1), with F =0 , can be written 


J=<Qe,e> 4<Ru,u> ? (4.7.9) 

ir(0,T;H“(D)) L (0, T; U) 

where T is finite. We now state the following lemma: 

Lemma 4.4 : The control which minimizes the cost functional 

(4.7.9) is of the form 

u*(t) = -R ^B*[ K(t)x(t) -S(t)x d (t)] (4.7.10) 

and the minimum cost function is given by 


J(x,x,,t) = <K(t)x,x> - 2 <S( t)x , , x> 

H^(D) 



(D) 


+ <P[t)x d ,x d > 


(4.7.11) 


where K(t) is the solution of the Riccati operator equation (4.4.11) 
with R(T) = 0, S{t) is the solution of the operator equation 


S(t) = -A*S(t) - S(t)G 4 K(t)BR 1 B*S(t)-Q (4.7.12) 

with S(T)=0, and P(t) is the solution of the operator equation 

P(t) = -G*P(t) - P(t)G 4 S(t)BR _1 B4S(t) - Q (4.7.13) 

P(T) = 0. 

Proof: Using the identity 


i 

f [ <K(t)x(t),x(t)> m -2<S(t)x d (t),x(t)> 


H m (D) 
o v ' 


H m (D) 
o v 


4 <P(t)x d (t),x d (t)> 




with 
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= -[ <K{0)x(0) ,x{0) > - 2<S(0)x d (0) 3 x(0)> 

+ <P(0)x (0),x (0)> 

d d H a “(D) 
o 

and following exactly the same procedure that was used in the 
proof of Theorem 4.8, the following expression is obtained 
for the cost : 


H^(D) 
~ ] 


J = <K(0)x(0} ,x(0)> -2<S(0)x (0),x(0)> 

H“(D) 


(4.7.14) 

+ J <R[R _1 B*(K(t)x(t)-S{t)x d (t)) + u(t)] , 

0 

[ R _1 B-(K(.t)x(t)-S(t)x d (t)) + u(t)]> u dt 

Since the last term in Eq. 4. 7. 14 is nonnegative, the cost is 
minimized if and only if this last term is 0, and this is 
achieved if and only if the optimal control is given by Eq.4.7. 10 
Moreover, the -minimum value of the cost, starting at time 
t with x(t)=x and x d (t)=x d> is given by Eq. 4.7.11. 

Bet us consider the special case where the controlled system has 
exactly the same dynamics as the system which is "tracked, 11 that is, 
let us suppose G=A. Eq . 4.7.12 becomes 

S(t) = -A*S(t) -S(t)A+ K(t)BR _1 B*S(t) - Q (4.7.15) 

It can be shown, by use of Theorem 4. 9, that Eq. 4.7.15 has a unique, 
positive, self-adjoint solution. Since S(t)=K(t) satisfies this equation, 
it must be the unique solution. Similarly, it can be shown that the so- 
lution of (4.7.13), in this case, is P(t)=K(t), so that the optimal control. 


+ <P(0)x d (0),x d (0)> 




H^(D) 
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from Eq, 4.7.10, is given by 

u*{t) = -R 1 B*K(t)[x(t)-x d (t)] = -R -1 B*K(t)e(t) 
and the minimum cost, from Eq. 4.7. 11, is given by 


J(x,x d ,t) = <K(t)[x(t)-x d (t)] , [x{t)-x d (t)] > 


= < K(t)e(t), e(t)> = J(e, t) 

Hq (D) 


These results are intuitively satisfying in that, when we assume that 
G=A, the error e(t) satisfies the same dynamical equation as the state 
x(t), so that minimization of the cost functional (4. 7. 9) should yield 
precisely the same equations, in terms of e(t), for the optimal con- 
trol and minimum cost function as were obtained, in terms of x(t), for 
the optimal control problem with x d (t)=0. 

Let us consider the behavior of the optimal solution when the „ 
t erminal -time T is infinite and the operators G and A are unequal. 
We have shown that the feedback operator is , the bounded solution 
of the time -invariant algebraic operator equation (4.7.4). This implies 
that S(t) is the solution of the time -invariant operator differential 
equation 

S(t) = -[A*-K^ ) BR 1 B#]S(t) - S(t)G -Q (4.7.16) 


As a preliminary to writing a solution to this equation, let us take note 
of the fact (see Ref. 21, p. 389) that if an operator L is the in- 
finitesimal generator of a semigroup of operators, and if N is a 
bounded operator, then the operator L+N, defined on the domain of L, 
is the infinitesimal generator of a semigroup of operators. Thus, the 
operator A^-K^BR ^B* is the infinitesimal generator of a semigroup, 
which we shall denote {^( t )} t€ [ Qco] • We hav e already assumed that 
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G is the infinitesimal generator of a semigroup of operators, which 
we denote {^(t) q <*,] * vir hie of Lemma 4.2, we can write the 

solution of Eq. 4. 7.16 on the infinite time interval as 


CO 

S(t) = J $ 1 (cr-t)Q 1 ^{o--t)dcr (4.7.17) 

t 

and the solution of Eq . 4.7.13 as 

CO 

P(t) = J ^*{o--t)[Q-S(o-)BR : 1 B*S(<r)l^(<r-t)d<r (4.7.18) 

t 

It might reasonably be asked at this point whether time-invariant 
operator solutions S and P exist to Eqs. 4.7.16 and 4.7.13, re- 

1 CO CO 

spectively. A time -invariant operator solution of Eq. 4.7.16 

must satisfy the algebraic operator equation 

[A* - K BR~^B*]S + S G = -Q (4.7.19) 

L CO CO CO 

If we consider the finite dimensional version of this equation, namely 


the matrix equation 

AX -f X B = C (4.7.20) 

where A, B, C, and X are real nXn matrices, we may make use of 
a well-known result (see Ref. 30, p. 231) and conclude that a necessary 


and sufficient condition for Eq. 4.7.20 to have a solution X is that 

, ^ 

X 4 . jjl . £ o for all i, j = 1 , 2 * . . . n, where X. and [a. are the i and 
1 <3 ^ J 

j th eigenvalues of A and B, respectively. 

Let us try to generalize this result so as to obtain a condition for 


the existence of a solution to Eq . 4.7.19. If L is any (bounded 

or unbounded) linear operator defined on H^ 1 (D), then, since H^(D) 

2 

is a subspace of L (D) and therefore has a basis which is a subset of 
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an orthonormal basis {e.}._^ °f I~>^(D), we may define the infinite 
til 

matrix L with ij element 

L. . = x^Le.je.^ 

U 1 J 

Of course, the matrix (Vl_-L) will fail to have an inverse if and only if 
\ is an element of the spectrum of L. Using this procedure, Eq.4.7.19 
can be represented by an infinite matrix equation. If we let the oper- 
ators (A^-K^BR *B$), S^, G, and Q be represented by the infinite 
matrices A, S^, G, and Q, respectively, then we may rewrite 
Eq . 4 . 7 . 1 9 as 

AS + S G = -Q (4. 7.21) 

for which we may state the following existence lemma: 

Lemma 4.5 : Equation 4.7.21 has a solution if and only if 0 

is not an element of oqA^-K^BR cr(G),* where cr(L) denotes the 

spectrum of the operator L. 

Proo f: We shall prove this lemma by generalizing the concept 

of Kronecker products (Ref. 3 0 t p. 227) to infinite dimensional 
matrices. By this means we may write the matrix equation 
(4.7.21) as the equation 

(Axi + ixG')^ = g. (4.7.22) 

where _I is the infinite dimensional identity matrix, x de- 
notes the Kronecker product, and and q_ are countably 

infinite vectors composed of the elements of and Q, 


H^(D) 


The symbol © denotes direct sum, that is, if and are two 
subsets of a vector space V, then 


h 1 ©h 2 


{xeV : x=Xj+x 2 


for some x^eH^ 


and x^eH^} 
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respectively. The manner in which the elements of and 

<q are chosen is as follows: 

If X is an infinite dimensional matrix, we successively 
choose the elements of the vector x by following the indi- 
cated path through the array 



It is easily seen that the spectrum of the matrix (AxI+IxG 1 ) 
is the direct sum of the spectrum of A 'and the spectrum of 
G, or 

cr(AxI + IX G') - <r(A*-K BR^B*) © <r(G) 

which implies that Eq. 4.7.22 has a vector solution 
if and only if 0 is not an element of this direct sum. Ex- 
istence of a vector solution s of Eq. 4.7.22 is, of course, 
equivalent to the existence of a matrix solution S_ __ of 
Eq. 4.7.21. 

We are now in a position to state and prove the following lemma 
concerning the existence of abounded solution to Eq. 4.7.19: 

Lemma 4.6 : If A*-K BR ^B* and G are strongly elliptic 


operators, as described in Section 2.3, then Eq. 4.7.19 has a bounded 
operator solution S^. 
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Proof : The strong ellipticity of A^-K^BR and G implies 

that the spectra of A*-K BR^B* and G are contained in the 
left-half of the complex plane and do not include the origin 
of the complex plane. This would imply that the direct sum 
u(A*-KBR ^B*)© cr(G) is contained in the left-half plane and 
0 is not an element of the direct sum. Thus, by Lemma 4.5, 
matrix equation (4.7.21) has a solution , and the cor- 
responding operator is a solution of the operator . 

equation (4 . 7 . 19) - The boundedness of is a consequence 

of the fact that 

S = lim S{t) 
oo 

t— *-oo 

where S(t) is the solution of the operator differential 
equation (4 . 7 . 1 6) given explicitly by expres sion (4 . 7 . 1 7) . We 
can show that S(t) is uniformly bounded for te[0,oo) by 
using the fact that A*-K BR B* and G are strongly 
elliptic and, thus, are infinitesimal generators of semigroups 
of operators {^(t)}^^ ^ and 0 , co)’ res P e ctiveiy, 

which have the property that* 

II ^(t) IllMje'^ 


and 


-R-, t 

||*(t)|[<M 2 e 

where M^, and are positive constants. 

Eq. 4.7.17 we may now write 


Us ing 


See Section 2.7, 
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oo 

llS(t) If </ II $-|(<r-t) || ||Q (I H^(o-t) ||do- 

t 


< m x m 2 


q| 


(^i+^ 2 ) t 


oo 


/ 


-(\+* z )<r 


do - 


M 1 M 2 

\ X 2 


II Q 


which implies that S is bounded. 

^ oo 

The existence of a time -invariant operator solution P__, to 

oo 

Eq. 4. 7. 13 hinges upon the existence of a solution to the algebraic 
operator equation 

G*P + P G = S BR _1 B-S -Q (4.7.23) 

OO OO OO CD 

It is quite clear that under the assumptions of Lemma 4. 6, a bounded 

solution P exists, 
oo 

We shall return to the discussion of infinite terminal -time prob- 
lems in Section 5.3, where the case of pointwise control will be con- 
side red. 


4 . 8 Derivation of the Riccati Integro -differential Equation 

In the preceding sections of this chapter we have shown that a 
bounded, positive, self-adjoint, optimal feedback ope rator exists . ( 1 ) 
in the case where the system operator is coercive , and , (2) with the 
additional assumption of complete controllability, in ''he case where the 
system operator is the infinitesimal generator of a semigroup of oper - 
ators . The optimal feedback operator K(t) is the solution of the 
Riccati operator equation. Unfortunately, there are no straightforward 
procedures for solving operator equations directly. It is the purpose of 
this section to derive an equation from the Riccati operator equation 
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which can be "solved" analytically or numerically. This will be 

achieved by showing that K(t) can be represented by an integral 

operator. An integro -differential equation will then be derived for 

the kernel of this integral operator and an expression for the optimal 

cost will be specified. Under the assumption that the state -weighting 

cost term of the cost functional is of the form < Qx, X> - 

Lr(0 > Tjlr (D)) 

2 2 

where Q(t) is a bounded linear operator from L (D)— - L (D), we shall 

prove that the optimal feedback operator K(t) is a bounded linear 
2 2 

operator from L (D)— L (D) . This will enable us to specify a particularly 
simple form for the optimal cost function. 

As an introduction to the concept of representing bounded linear 

operators by integral operators, let us examine I the identity 

C°°(D) 

o 

operator on the space of infinitly differentiable functions with compact 
support in D, which space is discussed in Chapter n, Section 2, This 
operator can be represented by the following integral operator: 


I <j> = 4>(z) = j 6(z-£)<H&)d& , ¥ <j>eC°°(D) 

C o D 

The important thing to note from this is that the kernel of the integral 
operator, the Dirac delta function, is a distribution on DX D. Indeed, 
Laurent Schwartz (see Ref. 31, Theorem 1) proves that any distribution 
on DXD is the kernel of a continuous linear operator from C^°(D) 
into (D), the space of distributions on D discussed in Section 2.2. 
As a matter of notation let us denote the kernel by L(z, £) and the cor- 
responding continuous linear operator by L, Since we are interested 
in the possible representation of the feedback operator K(t) by an 
integral operator, we are naturally more interested in the converse of 



- 122 - 


this statement. Schwartz (Ref. 31, Theorem 2) proves that the converse 
is true, namely, that every continuous linear operator L from C^°(D) 
into oCf (D) can be represented by a unique integral operator whose 
kernel, L(z,£,), is a distribution on DXD. 

Thus, having seen that the integral operator representation holds 
for bounded linear operators on C^(D) we must determine when this 
representation holds for bounded linear operators from H^(D) into 
H^(D) . Once again, Schwartz provides the answer in the so-called 
Schwartz Kernel theorem: 

Theorem 4.12: If and are locally convex spaces and h 

is a continuous linear operator from into H^, and if the following 

are, true: 

1. C^°(D)CH.c: Hjc^(D) ; i=l, 2 

2. C°°(D) is dense in H, fl H 7 

o 12 

then L can be represented by a unique integral operator whose kernel 
L(z, £,) is a distribution on DXD. 


The proof of this theorem follows from the fact that since 


c”(D)CHj and [T)) , then the restriction of L to C^(D) is a 


\cdi 


oo, 


oo 

continuous linear operator from C q (D) int 


o tCf 


(D) and can be re- 


presented by an integral operator, which, from the fact that C^°(D) is 
dense in RH^,, can be extended to by the continuity of the oper- 

ator L . 


It is not very difficult to see that the optimal linear feedback 
operator K(t) satisfies the hypotheses of Theorem 4.12 for all te[0,T]. 
For any t, K(t) is abounded linear operator from the Hilbert space 
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H q {D) into itself, implying that K{t) is continuous, since boundedness 
of an operator on a Hilbert space is equivalent to continuity. The space 
H^(D) clearly satisfies condition 1 of Theorem 4.12 and, since both 
C"(D) and H^(D) are dense in L 2 (D), C^°(D) is dense in h“(D), 
satisfying condition 2. Thus, by Theorem 4. 12 there exists a kernel 
K{t, z, £,) such that 

K(t)x = f K{t.. fW x(t)dt VxeC>) (4.8.1) 

D 

and if xeH^(D) fl c”(D) 

K(t)x = lim jK(t, z,(,)x n (Qd(, (4.8.2) 

n^oo D 

where "f x n ^n-T is a sec l uence hi C^°(D) convergent to x. 

To simplify notation in the sequel let us assume Eq.. 4.8.1 holds 
for all xeH^(D) with the tacit assumption that Eq. 4.8.2 truly represents 
K(t) if x/C^°(D). It can also be shown that if K(t) is continuously 
differentiable with respect to t, then K(t,z,£) is continuously differentiable 
with respect to t, and 

K{t}x = / fp t,Z,C) x(QdC (4.8.3) 

D 

It will also be necessary in the sequel to consider the operator 

L»(t) = B(t)R _1 (t)B*(t) 

which is also a bounded linear operator from H^(D) into itself and 
therefore can be represented by 

L(t)x = f L(t, z, QMIW V*xeH^(D) 

D 


(4. 8.4) 
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Likewise, the state weighting operator Q(t) and the terminal state 
weighting operator F can be represented as 


Q(t)x = jQ(t f z,gx(£)d£ VxeH^(D) 
D 


(4.8.5) 


and 

Fx = J*F(z,£)x(£)d£, 
D 


"We are now prepared to apply these results of the kernel theorem 
to the Riccati equation 

K(t)x = -K(t)Ax - A*K(t)x + K(t)L(t)K(t)x - Q(t)x ; K(T)x = Fx 

(4.8.6) 

where x is an arbitrary element of H^fD) . L.et us first note that 
the term A*K(*)x has the representation 


A*K(t)x = A* d£ = JaJ K(t f z f £)x(£)d£ (4.8.7) 

D D 

where the subscri pt z in the right-hand equality denotes the fact that 
A* is a differential operator in the z spatial variable, operating on 
K(t, z, £,) . In the case of the term K(t)Ax, we have 

K(t) Ax = jK(t, z,QA^x(^)dC 
D 

where the subscript t, refers to spatial differentiation in terms of £, . 
But, for fixed z and t, we can look upon K(t, z, t) as an element of 
eO' (D), the distributions on D, so that by elementary properties of 
distributions, the integral can be rewritten 

K(t)Ax = j*K(t,z, C)A^x(£)d& = J*A|K(t,z,^)x(C)dC (4.8.8) 

D D 

i.e. , the kernel of the operator K(t) A is A^K(t,z,£,). 
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Using Eqs . 4.8.1 and 4.8.4 we can write the following: 

K(t)L(t)K(t)x = J*K(t, z, p) Jut,p,<r) f K(t, cr, £)x<£)d& da dp 
D D D 


= f f jK(t,z,p)L(t,p,o-)K(t f «r,«x{&)dCd<rdp (4.8.9) 
D D D 

Now, using Eqs . 4.8.3, 4. 8.5, 4.8.7, 4. 8,8 and 4,8.9 in the Riccati 
equation (4 . 8 . 6) , we obtain 


/f=T ( t, Z ,C)x(&)d£. = - /A*K(t,z,£)x{&)dC - /A*K(t, z,£)x(£)dE, 

D D D 

+ / / / K,f , Z,p) L(t, p, (r)K(t, cr, £,)x( f,) d^dadp 
D D D 

-jQ(t, Z ,l)x(i)dl ; /K{T,z,Qx(C)dC = Jr(z,Qx(C)d^ 

D D D 

Since this equation must be true for all xeH^(D), it must be true that 


(t, z, £,) = -{A* + A*)K(t,z,£) 

+ / JlC(t, z,p)L(t, p, <r)K(t, a, Qdcrdp -Q(t, z, £) 

D D 

(4.8.10) 


and K(T,z, Q = F(z,£) 

Thus, we have derived an integro -differential equation of the 
Riccati type. It is quite clear that the kernel K(t, z, £,) is symmetric 
in its spatial arguments, that is, 


K(t,z,U = K(t, z) 

This is a direct consequence of the fact that K(t) is self-adjoint (see 
Sections 4. 4 and 4. 6) for all te[ 0, T] . 
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Boundary conditions may be specified for the Riccati integro- 
differential equation (4. 8. 10) . If ze9D, the boundary of D, then by 
the transformation in Eq. 4.8.1 we obtain 


y(t, z) 


ze8D 


= J* K(t, z. 


i) 


x(t,£)d?; 


D 


'ze8D 


the evaluation of a function y(t) in H^ a (D) at the point ze8D. But 

this is zero by the definition of H^{D) , which implies that 

K(t, z, t) =0 where £ is any element in D. In a like manner, it 

ze8D 

can be shown that all of the Dirichlet boundary conditions hold for 


K(t, z, Q, that is refe rring to Section 2 . 5, 
K{t, z, i) 


9K(t, z, i) 



ze8D 9n 

« • ♦ 

ze8D 

„ m-l 
9n 


- 0 


'ze9D 

where n is the normal to the' boundary 8D, Moreover, by the symmetry 
of the kernel, the above boundary conditions must also hold for £e8D 
and z any element of D, that is 


Kft z £l| 9K(t ? z, £,) 


Ce8D 


_ 8 I3a ~ 1 K<t 1 z,£) 

an 111 ' 1 


= 0 


! ^e8D 

Now that the Riccati operator equation (4.4. 11) has been trans- 
formed into an integro-differential equation, we may specify Eq. 4.4.12, 
the equation for g(t), as an integro-differential equation: 


= -A* g( t, *> + 


dt 


/ JW.z, 


4)E(t, £,, o-)g(t, cr) do-df, 


D D 


+ J*Q(f J z ,^) x d (t,^)d^ 

D 

g(T,z) = - j*F(z,U x d (T,C)dC 


D 
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Since g(t) is an element of H q (D), the Dirichlet boundary conditions 
must again be satisfied. 

Recall that the minimum value of the cost is given by the ex- 
pression 

J = <K(0)x(0),x(0)> + 2 <g(0),x{0)> + cj>{0) (4.8.11) 

H q (D) H^(D) 

where 4>(t) is the solution of Eq . 4.4.14 . Using the integral operator 
representation for K(t) , we may evaulate the inner products in ex- 


pression (4. 8. 11) according to the definition in Chapterll of inner product 
on the Sobolev space H^D), and, thus, we obtain* 

J= /X (D q /K(0,z,Qx{0, QdU(D q x{0,z))dz 
D [q |< m D 


II 


(4.8.12) 


D q g(0, z) D^fO, z) dz + c|>(0) 


a < m 


where <|>(t) satisfies the ordinary differential equation 

<j>(t) = - IX (D q J*Q(t, z,£)x(t, t,)df,)(D q x(t, z))dz 

D | q [ < m D 

(4. 8.13) 

*11 D q g(t, z) D q J*L(t, z, t)dt,dz 

D [q|<m D 

4>(T) = II (D q jF(z,.£)x d (T, C)dU(D q x d (T,z))dz 
D |q |< m D 

The above expressions for the cost terms are extremely compli- 
cated. This is not surprising, however, since our state -weighting 

cost term <Q(x-x d ) , x-x > may be written 

l/(0,T;H^(D)) 

The notation D q is described in Section 2.2. 
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< Q(x-x ),x-x > 

d d IT(0, T;H^(D)) 


-//£ (D q J*Q(t, z,Q(x(t, C)"X d (t 5 0)dyD q (x(t,z)-x d (t, z))dzdt 
0 D |q |< m D 

Using elementary properties of distributions this expres sion may be re- 
written as 


<Q{x-x d ),x-x d > 


L Z (0,T;H^(D)) 


= JJ J*Q(t ) z, t) ^ { -1) ^ ^ D q (x(t ,C) -x d (t> C))D q (x(t , z) -x d (t , z) )dzd£,dt 

ODD 1 q [<xxr 

which indicates that we are, in actuality, weighting all spatial deriva - 
tives (up to order m) of the state in the quadratic cost functional . 

Now (assuming, for simplicity, that x ^(t) = 0), it might reason- 
ably be asked: can we have a state -weighting term in the cost functional 

of the form 


Qx , x ^ -> 

DT (0 , T; LT (D)) 


T 


■///' 


Q(t, z, £)x(t, z)x(t, C) dzd^dt 


(4.8.14) 


ODD 

where the operator Q(t) is now a bounded linear operator from D 2 (D) 

2 

into D (D), and will this result in the existence of an optimal feedback 

ope rat or ? Although the expression (4.8.14) has been written in the 

form of an inner product on D 2 (D), we are, in actuality, restricting x 

* 2 

to be in the subset H q (D) of D (D) in order that the system equation 

2 

be satisfied. Accordingly, . the inner product in D (D) which is re- 
presented by expression (4. 8. 14) must be of the form 
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[< Q(t)Ax(t), Ac(t)]> dt 

0 L < D > 

T 

-Iff Q(t, z, £>)x(t, z)x(t, £)d£,dzdt 
ODD 


(4. 8.15) 


where A is a bounded linear operator from H^ 1 (D) into L (D) . The 
left-hand side of Eq. 4.8.15 maybe written as 


u. 

f <CQ(t)Ax(t), Ax(t) !> ? dt 

i L(D) 


= J < AtQ(t)Ax(t) ,x(t) > 


0 


H m (D) 
o 


dt 


(4. 8.16) 


where A' is the adjoint of A. The operator A“'Q(t)A is a bounded 
linear operator from H^(D) into H^(D) , and, therefore, by Theorem 
4.12, has the integral operator representation 

ArQ(t) Ax(t) = J Q^t, z,£)x(t,Qdf, (4.8.17) 

D 

for some kernel distribution Q^(t,z,£). Combining Eq. 4. 8 . 1 5 and 4 . 8 . 1 6 
we obtain the relation 

< #Q(t)Ax(t),x(t)> = J JQ(t, z, C)x(t, z)x(t, C)dzd£, (4.8.18) 

«>> DD 

and, using Eq. 4.8.17, the left-hand side of Eq. 4.8.18 maybe ex- 
pressed as 

<./f Q(t)Ax{t),x(t)> = I Z-j D<1 /o^t, z,C)x(t, £) d£ D q x(t, z)dz 

o (D) Dlq|<m D 


= J J ( ^ {-l^D^DjfQjjt, z, £))x(t, z)x(t, Qdzdt, 
D D |qj<m 


(4.8.19) 
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where the operators D q and D q are the operator D q in the z and 

z t, 

spatial variables, respectively. The last equality in (4.8.19) follows 
from elementary results (see Ref. 17, pp. 323-337) in distribution 
theory. Thus, from Eqs . 4. 8 . 1 8 and 4. 8 . 19 it is seen that the kernel 
£) must satisfy the partial differential equation 


I 


(-l)l q £) = Q(t,z,g 


(4.8.20) 


,q < m 


Now, since Q^(t, z, £,) is the kernel of a bounded linear operator 
from H^(D) into H^D), Theorem 4. 11 implies that the Riccati oper- 
ator equation (4 . 4. 11) , with Q(t) taken to be A*Q(t)A, has a bounded, 
positive, self-adjoint solution K^(t), which, by Theorem 4.12, maybe 
represented by an integral operator with kernel K-^(t, z, £,). Moreover, 
this kernel must satisfy Eq . 4.8. 10, namely 

i) 


3t 


= -(A* + A-) K x (t, z, £) 


+ j J*K X ( t,z, p)L(t, p,cr)K 1 (t, 0 -, i) dcrdp - Q^t, z, £,) 

D D 

(4.8.21) 

Note that the double integral term in Eq. 4.8.21 is in the form of an 
inner product on L (D) , so that we may use the same reasoning which 
led to Eq. 4.8.20 to state that there exists a kernel Iq(t,z, £,) , cor- 
responding to a bounded operator Lq(t) from H^D) into I-I^ 1 (D), 
such that 


X ( -l)^D q D q L^(t, z, Q = L(t,z,C) 


(4.8.22) 


k l£ m 

Let us now perform this type of operation on the solution 


Kj^t.z, Q of Eq. 4.8.21, that is, let 
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X (-l^D^K^t, z,C) = Kft.z.C) 
kll “i 


(4. 8.23) 


It is clear that K(t,z , £,) is the kernel of a bounded linear operator K(t) 
2 2 

from L (D) into L. (D) . We shall now. proceed to determine the 

equation which K(t,z,£,) must satisfy. Using the operator 

X (-l)^D q D q on each term of Eq. 4.8.21, we see, first, that 

lq[<m 

|q [ <m 

Next, if we assume that A' 1 ' is a constant coefficient differential oper- 
> z 

ator, we obtain 

I 

| q ! < m kl£ m 

The same result holds for the term containing . Using Eq. 4.8.22 
and elementary properties of distributions, we obtain 

X {-1)^D^D < || J jKjft, z,p)L(t, p, iriK^t, cr, ^)dtrdp 
[q|<m " D D 


(-l)l q! D^ AjC^t, z, t) = A* X (-IJ^D^Kjft, z, £)=A*K(t , z, t) 


= J J*K(t, z, p)L 1 (t, p, o-)K(t, cr, t,) dcrdp 


D D 

where L^(t,p, cr) is given by Eq. 4.8.22. Us ing all of thes e results and 
Eq . 4.8.20 we see that K(t,z, Q is the solution of the Riccati integro- 
differential equation 

A wu 

L ‘ ■ A *’f \ 


M(t,z,C) = _ (A ;+ ApK(t,z,C) 


+ J J K(t, z,p)Lj(<-, p, o-)K(t, cr , £,)do-dp - Q(t, z, 


(4.8.24) 


U 


D D 


Moreover, it may be seen that the optimal cost is given by 



-132- 


J = <K 1 {0)x o ,x o > = X f(D^ fK l (0, Z ,L)x o a)dt)(D%c o (z))dz 

rf o {D) |q|<m D D 


= ff K( ° , z, ^)x o (z)x Q (^)d^d: 


(4.8.25) 


D D 


Thus, we have shown that, corresponding to a state -weighting operator 

2 2 

Q(t) w hich is a bounded linear operator from L (D) into L(D), there 

2 

exists a bounded linear fee d back operator K(t) from L (D) into 
L^(D), and the kernel K(t, z,C) of the integral representation of this 
operator satisfies the Riccati inte gro-differential equation ( 4.8.24). 

Let us now consider the case of infinite terminal time. If we 
assume that x^(t) = 0, B(t)=B, Q(t)=Q, and R(t) = R then the time- 
invariant optimal feedback operator is the solution of the alge- 

braic Riccati operator equation (4.7.4). Using the procedures of this 
section, we find that operator has an integral representation with 

kernel K (z,£) which satisfies the equation 

“(A* + A*)K oo (z t &)+ / Jk^z, p)L(p,o-)K oo ((r, Qdcdp - Q(z,£) = 0 

D D 


(4. 8.26) 


where L(p, o - ) is the kernel of the integral representation of the time - 
invariant operator L-BR ^B*. 

To illustrate Eq. 4.8.26, let us, once again, consider the heat 
equation example given at the end of Section 4,6. 

= 8 " Zy - % Z) + u(t, z) ; x{0, z) = x o (z) 

dz 


with the boundary conditions 


x(t,0) = x(t, 1) = 0 
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Let us choose the cost functional 
oo | 1 1 


J = 


/ 

0 


f J ( 477 ) sinirz simr£x(t, z)x(t, £)dzd£, + J*u 2 {t,z)dz 


0 0 


dt 


4 

which corresponds to choosing the kernel of Q to be { — g )simrzsimr£, 

and the kernel of R to be the Dirac delta function 6 (z- 6 ). The opti- 
mal feedback kernel K {z, £,) must satisfy Eq. 4.8.26, which, for 

oo 

this example, becomes 

- 4 K m < z - ~4 K co< z ’ « + / K a ,(^P) K a ,(P.« d P 

9z 96 n 


, 4tT +1, . • r 

( 5 — ) sinirz simr 6 


with boundary conditions 

K oo^ = K oo (z '° } = K oo {Zsl) = ° 

The solution, by inspection is 


K (z, 6 ) = simrz simrf, 


so that the optimal control may be written 


u*(t , z) = 


-I 


K^z, E,)x(t, £)d£, 


simrz 


1 

/ 


sinirt x(t, 6 )d 6 



CHAPTER V 


OPTIMAL POINTWISE FEEDBACK CONTROL 
5 . I INTRODUCTION 

In this chapter we shall specialize the results obtained in Chap- 
ter IV for the parabolic optimal control problem to the pointwise opti- 
mal control problem defined in Chapter III. Section 5,2 is concerned 
with the actual derivation of the optimal pointwise feedback control. 

It will be shown that the optimal pointwise control is of a form which is, 
in a sense, computationally simpler than the general feedback form of 
the optimal control derived in Section 4.8. In Section 5.3 it will be 
shown that a particular choice of the state -weighting operator results 
in the traditional modal analytic solution. Still another choice of the 
state -weighting operator will be shown to result in a feedback solution 
of the pointwise optimal control problem under the condition that only 
a finite number of specific measurements, rather than the entire 
state, are available. 

5.2 DERIVATION OF THE OPTIMAL POINTWISE 

FEEDBACK CONTROL 

In this section the pointwise control problem is solved by placing 
the problem within the format of Section 4.8, that is, by introducing the 
feedback integral operator and writing the Riccati integro -differential 
equation. Since the pointwise control problem is characterized, 
mathematically, by the control space U=R and the pointwise control 
operator B Q (t) defined in Section 3.4, we know, from the results of 
Chapter IV, that an optimal control of the form u*(t)=-R ^B'^(t)K{t)x(t) 
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exists* where K(t). satisfies the Riccati operator equation (4.4.11) 


with B{t)=B Q (t). We also know, from the results of Section 4. 8, 
that the Riccati operator equation may be represented by the integro- 
differential equation (4. 8. 1 0) , namely 

= -(A*+ A*)K(t,z,C) 

+ J J > K(t > z,p)L o (t, p,o-)K(t,cr, £)dcrdp - Q.(t, z,i) 

D D 


with 

K(T, z, £) = F(z,Q 

where L Q {t, p,tr) is the kernel of the operator 

L o (t) = B o (t)R" 1 (t)B*(t) 

Attention will be focused on the nonlinear term of the Riccati equation, 
in which the kernel of the operator E o (t) appears. It will be shown 
that in this case a simplified form of the Riccati integro-differential 
equation holds the solution of which leads, in an approximate sense, to 
a simplified form of the optimal control. The infinite time problem 
will also be discussed. 

k 2 

Recalling that the pointwise control operator B Q (t):R — *Li (D) is 
of the form: 

k 

B 0 ( t )li( t ) = ^Tx .(z)b.(t)u.(t) , V-u(t)eR k ,Vte[0,T] 

i=l 

where X j( z ) i- s > a-gain, the characteristic function of the set E^CD 

as specified in Section 3.4. The adjoint pointwise control operator 
* 2 k 

B^(t):Lr (D)— *-R may be determined in the following manner; 


We shall, for simplicity, consider the case where x^(t)=0. 
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If y{.)eL?(D) and ueR^, then 


<B*(t)y(-),u > k = <y(').B (t)u> 2 
° R* ° LT(D) 


K 


D i=l 

k 

= 'y' Jy{z^.(z)dz b.(t)u. 
i=l D 


From this we identify B^(t)y{-) as the vector in R 


r T 

R" 0 (t)y{') = b j(t) J Xi( z )y( z ) dz 


In order to obtain an equation for the feedback kernel in the form of the 
Riccati integro- differential equation (4.8.10) we must express the 
operator L o (t) = B o (t)R (t)B'^(t) as an integral operator with kernel 
hi {t,p,<r). Using the dummy variable cr with the B^(t) operator and 
the dummy variable p with the B Q (t) operator* we obtain 


(B o (t)y)(p) 


B^RV) h.(t) f X .(<r}y(<r)d<r 

_ ° l 


B o (t) ^ Rjj(t)bj(t) J x j(cr) y(cr) dcr 
j =1 i D 


^JC i(p) b > i (t) Rjj(t)b^.(t) J x j(cr) y(cr )du 

i=l j=l D 

k k 

/II .(p)b i (t)R.j(t)b j .(t)y^((r) y(cr)do- 
d itf j=i 
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so that the kernel L Q (t, p, cr) is given by 

k k 

x .{pJb.ttJR^tJb.ftJXjfo-) 

i=l j=l 

The nonlinear term in Eq. 4. 8.10 may now be written as 


J J*K{t, z, p)L Q (t, p,o-)K(t, cr, Udtrdp 


D D 


k k 


X i (p)b.(t)R^(t)bj(t)x j(cr)K(t, cr, t,)dtrdp 
D D i=l j=l 


k k 


= V W) Jx i (p)K(t, z, p)dp)RT^{t)(bj{t) J Xj (o-)K(t,<r, Oder) 
i=l j=l D D 


Let us define the vector function k(t,z) to be 


r I 

k(t,z) = b.(t) Jx i (p)K(t, z, p)dp ;te[0,T] , zeD 


Using this vector function, we may rewrite Eq. 5.2.Z as 
J* J*K{t, z, p)L Q (t, p, cr)K{t, cr, £)dcrdp = k'(t, z)R _1 (t)k (t, £) 


D D 


and the Riccati integro-differential equation (4. 8.10) for the pointwise 
control problem becomes: 

!*L ( t,z,0 = -(A*+A*)K(t,z,&)+k , (t,z)R: 1 (t)k(t,&)-Q(t,z,t) 

9t z 4 


The expression for the optimal pointwise control is 
u*(t) = -R" 1 (t)B*{t)K(t)x(t) 
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= -R _ 1 (t)B*{t) J K(t, z, C,)x(t, C)d£ 
D 


t 1 

m \ 

= "R _ 1 (t) 

t>i(t) J x i ( z ) J K(t, z,C)x{t,t,)d£,dz 

D D i 

— 

= -R'V) 

J [ b i (t) z,f,)d z ]x{t, £)d£, 

_D D J 

= -R _ 1 {t). 

fk(t,C)x{t, Qdi 


D 


We shall now introduce an approximation by the use of the as- 
sumption in Section 3 .4 that control action takes place over "volumes" 
Ln D which are very small compared to D itself. In other words it 
may be assumed that each of the sets containing the points 

fz has measure* u(E.) < e, where e is very small compared to 

1 i J i=l i *” 

ul(D). Let us also assume that the control operator coefficients 
b.(t), i=l,...k are of the order of l/e, that is, let us assume that 

b.(t) = j P-(t) 

The physical motivation for this assumption lies in the fact that unless 
the control coefficients were of the order of l/e then any finite 
amount of control would enter the system with magnitude of order e, 
and, under the assumption that e is very small, would have no effect 
on the system. If the control coefficients b.{t) are of order l/e, 
then we shall see that finite control energy results in a forcing term of 
the same order of magnitude in the system equation. 


n 


H- 


is the Lebesgue measure in R . 
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If K(t,z, i) is sufficiently smooth and e is chosen small enough then 
we have approximation 

b^t) jy .(p) K (t, z, p)dp — p.(t)K(t, z, z.) 

D 

which holds for i=l,2,. . .k. What, in effect, has been done here is to 
assume that the control coefficients are approximately impulsive in the 
spatial variable. Note that this assumption was invalid in the rigorous 
proofs of existence of optimal controls and existence of solutions of the 
Riccati operator equation. However, at this juncture, the assumption 
is valid because we are simply trying to solve approximately an 
equation which we already know has a solution. 

As a consequence of the above approximation the vector function 
k (t, z) can be approximated by 

k(t, z) — k (t, z) 


where 


k(t,z) = 


t 


(5.2.5) 


P 1 (t> K(t,z, z.) 

With this approximation the Riccati integro-differential equation becomes 

^ (t, z, C) = - ( A* + A*)K(t,z,£) + k'(t, z)R _1 (t)k(t,-Q -Q{t, z,Q (5.2.6) 
dt z t 

What is more interesting is that if the above approximation is used in 
Eq. 5.2.4 the expression for the optimal control becomes 


u*(t) = -R _1 (t) f k(t, 


D 


A _! r t 

> R- At) J |5.(t)K(t,&,z )x(t,£)d& 

L — r 1 J * J , J 

j=i D l 
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which implies that it is necessary to determine the k functions 
K(t, z, z^), i=l,2, . . .k in order to completely specify the optimal feed- 
back control. If one were to consider the computational requirements, 
then the computation of these k functions would be simpler than the 
computation of the entire feedback kernel, that is, the computation of 
K(t, z, C) for all values of both spatial arguments in DXD. 

To summarize the above results, let us examine the structure of 
the feedback control system. The state distribution x(t,z) is fed back 
through k devices which take a weighted spatial average of the state 
distribution. The weighting function in the i^ 1 averaging device is 
K{t, z, z.) , i=l, 2, . . . k, and the output is a function of time which may be 
denoted y.{t), i = l,2, ...k. The k-vector y(t), with i^ component 
y.(t), is then transformed to the optimal control 

u*(t) = -R - 1 (t)B(t):z(t) 

til 

where B(t) is the diagonal matrix with i diagonal element B.^(t) = 
|3.(t), the i th control coefficient. Let us illustrate the system thus 
obtained by means of a block diagram in which we use the conventions 
" — 11 to indicate the flow of a scalar quantity, " a k-vector, 

and "JlfU 1 *" a distributed quantity. The optimal closed-loop system is 
represented in Fig. 1. It is interesting to note from this feedback 
structure that if we were able to measure y(t) directly, that is, if we 
had k measuring devices which average the state distribution with 
weighting functions K(t, z, z .), i=l, 2, . . .k, then we would feed back 
the measurements, rather than the entire state distribution, in order to 
construct the optimal feedback control. This leads to a question which 
is somewhat analogous to the inverse problem of finite dimensional 
control theory, namely, if we have k measuring devices of the form 
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y.(t) = J m.(t, z)x(t, z)dz ; i=l,2,.,.k 

D 

where is an arbitrary set of weighting functions, then does 

there exist a state -weighting kernel Q-^(t, z » t) such that the solution 
K(t, z, £,) of Eq. 5.2.6 with Q(t, z, t) =Q^(t, z, Q satisfies the property 

K(t,z, z.) = m.{t, z) 

for all te[ 0, T] , for all zeD, and i=l, 2, . . .k ? Looked upon another way, 
obtaining a solution for the set of functions {K(t,z, z.) enables one 

to design appropriate instruments with weighting functions equal to 
K(t, z, z.) . This measurement question will be treated in Section 5.3, 
where a particular class of measurement weighting functions will be 
considered. 



Fig. 1 Optimal Closed-Loop Pointwise Control System 

Let us consider the pointwise control problem on the infinite 
time interval. If the system operator A is coercive then, according 
to Section 4.7, an optimal control exists on the infinite time interval. 
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Moreover, if the system is time invariant, the feedback kernel is 
K(t, z, £) = K(z, £,) and k (t, z) defined in ( 5 . 2 . 5) becomes 


k (t, z) 



t 1 

Pi K(z, z) 

A I 1 


so that we obtain the following time -invariant Riccati equation: 

-(A* + A|)K(z, i) + - Q{z, i) = 0 (5.2.7) 

Under the assumption that the system operator A is the in- 
finitesimal generator of a semigroup of operators, we know, from 
Section 4.7, that complete controllability is sufficient in order to 
guarantee the existence of an infinite time solution. In the case of 
pointwise control, the condition for complete controllability, namely, 
the invertibility of L L* given in Eq. 4.7.6, becomes, the determi- 

h t i 

nation of a time t^ such that the following is invertible 
t l 

L t L* = f $(t ] -cryB o (cr)B*lo-)$*(t 1 -<r)d t r 
which, for any xeH^D), reduces to 


L t L t X 
r l 1 


} 

’ r f 1 

J $(t 1 -cr)B o ((r) 

Pi (0-) Jx i (^)(^(t 1 -^)(C)dC 

t 

D 1 




^(tj~cr)y(cr)d(r 

t 



(5.2.8) 


(^=f=(t^-o-)x)(t.) is the evaluation of ^*(t^-or)xeH^ 1 (D) at the point £,eD. 
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where y(cr)eH™(D) is given by 

k 

y(«r,2) =^i {z) l 3 i (o ' ) /x i (C)(^ i (t 1 -<r)x)(C)dC 

i=l D 

Note that y(<x, z) is zero everywhere except on the sets E^, on which 
y(cr, z) has the values 

y(cr, Z) = (3? (cr) J*x *(£)($’’' (tq -or) x)(£)d£ ^zeE. 

D 

The question of invertibility of (5.2.8) for some time t^ is still open. 

It can be seen that a necessary condition for invertibility of expression 

(5.2.8) is that there exists a subset of (0, oo) with positive measure 

upon which the operator $(t) transforms the subspace of functions 
k 

with support on |J onto all of H^D) . Otherwise, there would be 

no chance for the 1 "operator given by (5. 2.8) to have its range space 
equal to H^(D) for some time tj, which is equivalent to inverti- 
bility. Since controllability is still a matter of open research, we shall 
assume it, where necessary, in the sequel. 

In any case, if controllability is assumed, then infinite time so- 
lutions exist in the case where the system operator is the infinitesi- 
mal generator of a semigroup of operators and the time -invariant 
Riccati integro- differential equation (5.2.7) holds in this case as welkr 
The optimal control, in this case, is given by the time -invariant 
linear feedback control law 

u^t) = -R J* k ( z)x(t, z) dz (5.2.9) 

D 

and the optimal cost function is 

K(z, Qx(t, z)x(t, £)d£dz 

D D 
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As an example of a pointwise control problem let us, once 


again, consider the scalar heat equation 

9x(t, z) 9 > 5) - + B u(t) 

9t 9 z 2 


x(0, z) = x q (z) 


with boundary conditions 


x(t, 0} = x(t, 1) = 0 


B 0 a(t) = £*^1 


If we choose a cost criterion of the form 


od 1 1 


■ -/ //«-'*! sin 2 Trz.)sinirzsimr^x(t, z)x(t, £,)d£d5 


0 0 0 


4 u'(t)u{t) dt 


k 


where {z.}:^ is the set of contro1 points, the Riccati integro- 
differential equation (5.2.7) becomes 

2 2 A . k, 

y K(z, Q - K{z, £) + k'fzjk^) = (2ir Z fV^ sin 2 irz ) simrzsinir?, 

3z Z bt H 


for which 


K(z, £,) = simrz simr£ 


is a solution satisfying the boundary conditions 

K(0, t) =K(1,Q =K(z,0) = K(z, 1) = 0 

Since k(z) is of the form 


r t 

k{z) = simrz sinirz. 

i 1 


The optimal control, from Eq. 5.2.9, is 


u*{t) = 


sinrrzx(t, z)dz 
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We shall consider a special class of solutions of Eq. 5.2.7 in 
the next section which enable us to compare our results with those ob- 
tained by using the modal analytic approach. 

5.3 THE INFINITE TIME PROBLEM AND MODAL 

ANALYTIC SOLUTIONS 

In order to obtain a better physical understanding of the nature 
of the optimal pointwise feedback control obtained in the preceding 
section, we shall relate these results to the results obtained through 
the application of the techniques of modal analysis. We shall show 
that a particular choice of the form of the kernel Q(z, £,) in Eq. 5.2.7 
results, under certain conditions, in the transformation of the integro- 
differential equation (5.2.7) into an algebraic matrix equation. It 
will be shown that this finite dimensional Riccati equation is associated 
with the finite modal approximation of the optimal control problem 
under consideration. Placement of the control points will be shown to 
have a direct effect on the existence of an optimal modal solution. The 
optimal solution for an illustrative example will be studied. 

For convenience let us rewrite the Riccati integro-differential 
equation for the time -invariant feedback kernel associated with in- 
finite terminal -time pointwise control problem 

-(A* + A*)K(z, O+k'fzJR 1 ^) - Q(z, i) = 0 (5.3.1) 

z s 

A £}} 

where k (z) is the k-vector whose i component is 

k.(z) = j3.K(z,z.) 

the set {z }£_ lf once again, being the control points in D. 

As a preliminary to showing that the optimal modal solution can 
be deduced from Eq. 5.3.1, let us consider the case where no control 
is applied to the system. We can show the cost of starting at 



146- 


xeH (D) at time t to be 
o ' 


D D 


uu 

J Jx(z)K(z, £)x(£)d£dz = J J J*x(cr, z)Q(z, £)x(tr, £)d£dz do- (5.3.2) 


t D D 


where x(c r, z) is the evaluation at the point zeD of the element x(o-)eH Q (D) 


which satisfies 


x(<r) = A x (cr) ; x(t) = x 


and where K(z,£) is the solution of the linear equation 
-{A* + A|)K(z,C) - Q(z,i) = 0 


(5.3.3) 


Let us suppose that the system operator A has a countable spectrum 
The eigenfanctions {v^(z)}^^ of the adjoint operator A"*' 


satisfy the equation 


A v v. (z) = k.v.(z) 

z i' ' 1 i' ' 


for i = l, 2, . . . . If we choose the kernel of the state -weighting oper- 


ator to be 


Q<Z,£) = v«(z)Qv(9 


(5.3.4) 


where Q is an nXn positive definite constant matrix and v{z) is 
the n-vector whose i** 1 component is the eigenfunction w(z), the state 
weighting operator will still satisfy the requirement of positivity. 


since 


<Q,x,x> 7 = f /x(z)v‘(z)Q v (C)x(C)df,d: 

L < D > D D 

= J* x(z)v'(z)dz Q J*v(C)x(C)d^ 

- n J n 


= x* Qx > 0 


where x is the n-vector whose i component is 


/x(z)v 

T'v A 


(z)dz. Note 


that the operator Q is not strictly positive since there exist nonzero 
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vectors xeH Q (D) which are orthogonal to the subspace generated by 

the first n eigenfunctions, resulting in <(Qx,x^> _ =0. Note 

L (D) 

that if we allow n to approach infinity the kernel Q <Xi ( z j £) of a posi- 
tive operator is obtained. The precise nature of this limiting procedure 
will be discussed when the concepts of modal approximation are treated 
later on in this section. We may now state the following theorem: 

Theorem 5, 1 : If Q(z,£,) is giver by Eq. 5,3.4, then the optimal 
feedback* kernel for the zero -control case is given by 

K(z ,i) = v‘(z)Kv{C) (5.3.5) 

where K is the nXn positive definite solution matrix of the matrix 
equation 

A K+ KA - -Q (5.3.6) 

with A defined to be the diagonal nXn matrix with i^ 1 diagonal ele- 
me nt A. . = X . . 

ll l 

Pr oof: Using Eq. 5.3. 5 in Eq. 5.3.3 and using the linearity of 

A* we obtain 

A*v’(z)Kv(Q + v‘(z)KA^v (£,) = -v'(z)Qv(y 
Since the elements of v(z) are eigenfunctions of A^ , this 
equation becomes 

v'{z)AKv(£) + v'(z)KAv(£) = -v’(z)Qv(£,) 

If a solution of this equation is to exist for all z, £,eD , then 
it must be true that the matrix K satisfies Eq. 5.3.6. 
Moreover, since K(z, £,) must be the kernel of a positive 
operator on H'^ 1 (D), the matrix K must be positive definite. 

It is a well-known fact that if the matrix _Q is positive definite 


The term "feedback" is used loosely here, since we are applying 
no control. 
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and the matrix A has all its eigenvalues in the left half- 
plane, then a positive definite solution of Eq. 5.3.6 exists. 
The matrix Q is positive definite by assumption and, since 
the spectra of both coercive and strongly elliptic system 
operators lie in the left half -plane , the eigenvalues of A 
lie in the left half -plane, so that a positive definite matrix 
solution K to Eq. 5.3. 6 exists. Thus, K(z, £,)» given by 
Eq. 5.3.5, is the kernel of a positive operator which is the 
solution of Eq. 5.3.3. By the uniqueness* of positive so- 
lutions of Eq. 5.3.3 this kernel is optimal. 

We can conclude from this theorem that in the zero-control case 


the c o st function depends only on the first n mode coefficients of the 
initial state xeH^(D). This can be shown bv evaluating'' the cost 
function 


J = <Kx,x]> ? 

L (D) 


J* Jx(z) K(z, £)x(£)dtdz 
DD 


jx(z)v'(z)dz Kjx{£)v(£)d£ 
D D 


= x'Kx 

where x is the n-vector whose i^ component is the i^ mode coef- 
ficient x. = /x(z) v.{z)dz . 

1 D 

A natural question to ask at this juncture would be : does a 
s olution of the form (5,3,5) exist for the system with pointwise con- 
trol when the kernel Q{z, £,) i s again given by expression (5.3. 4) ? We 
shall show that under certain circumstances such a solution exists for 
the optimal feedback kernel and that the solution is directly related to 
the finite modal approximation of the original system. 

If the control is a k-vector, with k not necessarily equal to n 
(the dimension of the vector v(z)) the substitution of Eq. 5.3.5 into 


Uniqueness 


follows from the uniqueness of the limit in Theorem 4.11. 
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the pointwise Riccati integr o -differ ential equation ( 5, 3 . 1) yields the 
equation 

-(A‘ l '+A?)v , (z)Kv(C) + m'(z)R^^m(C.) - v'(z)Qv(C) = 0 (5.3.7) 

z c, 

th 

where m(z) is the k-vector whose i component is 

im (z) = P^v’(z)K v (z.) i=l , 2 , . . . k 

We can write the vector m(z) in the form 

m(z) = BVKv(z) (5.3.8) 

where B is the diagonal kXk matrix whose i diagonal element is 

B.. = 3. and V is the kXn matrix with i/* 1 element V . . = v.(z.) . 
n r i — y J 1 

Using Eq. 5.3.8 in Eq. 5.3.7, we see that the Riccati integro- 
differential equation (5.3.1) has a positive solution of the form 
K(z,£,) = v'(z)Kv(£) if there exists a positive definite solution K 
of the algebraic matrix Riccati equation 

-_AK - KA+ KY’BR^B VK - Q = 0 (5.3.9) 

where A is again the diagonal nXn matrix of eigenvalues. We know 
that this Riccati equation is associated with the following finite - 
dimensional optimization problem: 

Given the n -dimensional system 

x = Ax(t) + V'B u(t) ; x(°) = x 0 (5.3.10) 

Determine the control u* (t)eR , which minimizes the cost 

functional 

CD 

J = J [x'(t)Qx(t) + u'(t)Ru.(t)] dt . (5.3.11) 

0 

Thus, from our knowledge of the finite -dimensional state regu- 
lator, we know that Eq. 5.3.9 has a positive definite solution K if the 
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system (5.3. 10) is completely controllable.* We consider the 
standard test for complete controllability in time -invariant, finite 
dimensional, linear systems (see Athans and Falb, 24 p.205), namely, 
if _G is the nX(nk) matrix defined by 


G = 


V'B 


AV'B 


A 2 Y ’B 




B 


then the system (5.3. 10) is completely controllable' 


(5.3.12) 


if and only if 


rank G = n 

The system given by Eq. 5. 3. 10 is interesting in another re- 
spect. It is precisely the n-mode modal analytic approximation of the 

original distributed parameter system g iven _i n_Eq 3-.-2-.-2-w-i-th — B-B-y 

the pointwise control operator. This can be seen by considering the 
modal decomposition of the forcing term B Q u(t): 


J*v(z)B c u (t)dz 
D 


K 

= f v(z) ^x i ( z ) b i ^ i (t)d: 


D i=l 

k 


= Zw J*x( z k i ( z ) d5 


i=l D 


k 

i=l 


= V'Bu(t) 

which is the forcing term in Eq. 5.3.10. The preceding is summarized as: 
Theorem 5. 2 If Q{ z,£)=y'(z)Qy (1.) , with Q positive definite,** 
and if the rank of the matrix G, defined in Eq. 5.3. 12, is n, then 
there is a solution of the Riccati inte gro-diffe rential equation (5.3.1) 
which is the kernel of a positive operator and which has the form 
K(z,t,) = v’(z)K v (1,), where the matrix K is the positive definite 
lution of the matrix Riccati Eq. 5.3.9. 

TxT ' * 

'i' 

Observability is actually sufficient for definiteness. 

Positive semi -definite nes s is sufficient in this case. 


so- 
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Let us also note that the optimal pointwise feedback control, 
from Eq. 5. 2.4, is given by 

u*(t) = -R 1 f k(£)x(t,£)d t 
D 

(5.3.13) 

= -R^B V K Jv(Qx(t, £,)d£, = -R _1 B V K x(t) 

D 


with x(t) = J* v (C)x(t, Qdt, 

D 

i.e., x(t) is the n-vector of modal coefficients of x{t, z). Moreover, 
the minimum cost of starting at time t with initial state x eH^fD) is 
given by 

J = J j*x Q (z)K(z, &)x o (C)d&dz = Jx o (z)v*(z)dz K,J"v(Ux o (^)d^ 

D D D D 

= x' K x 
— o o 

where x is the n-vector of modal coefficients of x (z) . Thus, we 
— o o 

have shown that by choosing Q(z, i) to be of the form specified in 
Eq. 5.3.4, both the optimal control and the optimal cost function de- 
pend only on the first n modal coefficients of the state variable. 

This has very interesting implications as far as the modal analytic 
approach is concerned. In the modal analytic approach, a system of the 
form (5.3.. 10) is obtained, and a finite -dimensional cost functional of 
the form (5.3. 11) is used. Naturally, the optimal control and optimal 
cost function would only depend on the finite -dimensional state vari- 
able (the modal coefficients). It is difficult to say, one way or the 
other, via straightforward modal analytic techniques, whether feeding 



-152- 


back higher order modes would result in a smaller value of the cost 

functional. Theorem 5.2 allows us to make a definitive statement, 

namely: if the rank of the controllability matrix G is n then we can 

never do any better by feeding back more than the first n modes , If 

the rank of G is less than n, we know, from the results of Chapter IV 

and Section 5.2, that a positive operator kernel solution of the Riccati 

integro -differential equation (5.3.1) still exists, but it is not of the 

form K(z,£) = v'{z) Ky(t,), or, in other words, the optimal control and 

optimal cost function will depend on modes of order higher than n. 

The above results allow us to make still another conclusion con- 

. cjs rnin g-the-mo d al-an al-yt-ie-ap p r oximatron-. If w e a retrying to approx!^ 

th 

mate an arbitrary state -weighting kernel Q(z,l,) by the n order ap- 
proximate kernel 

Q n (*,Q - 4(*) Q n v n (£) (5.3.14) 

then the positive operator represented by this kernel is less than 

the operator Q, represented by the kernel Q(z, 1,) in the sense of the 

ordering relation introduced in Section 4. 6, Moreover, increasing the 

order of the modal approximation by one results in a more positive 

state -weighting operator, Q ^ that is, — s* 1188 * now, the 

th 

presence of the n+1 mode increases the cost. We may now ask 
whether this results in an increase in the resulting optimal cost, or, 
more precisely, it is true that we have the relation 

— K 2— K 3 

where the operator is the positive, self-adjoint solution of the 

Riccati operator equation 

-A*K-KA+ KB R X B*K - Q = 
o — o n 


0 


(5.3.15) 
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where B q is the time -invariant pointwise control operator. If we 
consider the operators and and the difference of the equations 

of the form of Eq. 5.3. 15 which they satisfy, we obtain 


- 1 , 


- A* ( K , - K ) - (K , - K )A+ K ,.BR B*K 
v n+1 n' ' n+1 n' n+1 n+1 


-K BR _1 B*K - (Q ,,-Q ) = 0 

n n n+1 n 


This equation may be written in the form 


-{A-BR -1 B*K )*6K -5K (A-BR X B*K ) 
' n n n n 


(5.3.16) 


-6K BR B*6K - (Q ,,-Q J = 0 
n n n+ 1 n 


where 6K = K - K 

n n+1 n 

Since (Q ^-Q^) a positive operator. Theorem 4. 11 implies that a 
positive, bounded solution of Eq. 5.3.16 exists, from which we 

may conclued that > K^. This result may be briefly summarized 

by the statement that monotone approximation of the state -weighting 
operator results in monotone approximation of the optimal feedback 
operator . It is difficult to prove this monotonicity property by direct 
modal analytic considerations, but when recourse is taken to the fact 
that any modal approximation of a given order n corresponds to a 
distributed optimization problem with state -weighting operator Q , 
the proof becomes quite simple. 

This result has a bearing on the problem of determining what 

order modal approximation to choose. If n is chosen so that 

Q (z, t,) is a good approximation to Q( z, £,) in the sense that 

ffi Q(z,Z) -Q n (z,U] x(z)x(£)dzd£, < e //q ( z, £)x(z)x(t)dzdC 
D D D D 

where e is a small positive number, then it is clearly seen that, by 
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using the above procedure, the feedback kernel K^fz, Q resulting 
from the solution of Eq. 5. 3. 1, with Q(z, £,) = Q n ( z » Q » w tll satisfy the 
inequality 

/ f[ K(z,l) -K n (z,U]x(z)x{C)dzdC 
D D 


< e 


//*■• 
D D 


t)x(z)x(£)dzdC 


where K(z, £,) is the optimal feedback kernel. This follows directly 
from solving Eq. 5.3 . 16 with forcing term (Q-Q n ) for the difference 
operator 5K = K-K . To summarize: an analytic procedure for d e- 
jt - e^r-min-i n-g- t-h which wi ll rGSu. l t in a.n 3,ppr oxirn3.tioi3. to the 
optimal cost of a particular degree of accuracy is to choose n such 
that the s t ate -weighting kernel is approximated to that degree of ac - 
cu racy . Let us now illustrate these ideas by means of the following 
example; 

E xample 5.1 : Consider the one -dimensional heat equation with 
pointwise control, described by the equation 

= ?- x( . t - ,z ) + b u ( t ) 0 < z < 1 

at „ 2 cr-' _ _ 

0 z 

where B' is the time -invariant pointwise control operator. Here, of 

2 

course, the system operator A is S/3z . Let us choose the boundary 
conditions" to be 

x(t, 0) = x(t, 1) = 0 

In this case the system operator A is self adjoint and the eigenvalues 
are 

, .2 2 

K. = -1 TT 

1 

with associated ei genfuctions 


vj (z) = sin iirz 
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Let us suppose that we are using two pointwise controls, that is, k=2. 
Moreover, let us choose the state weighting kernel Q(z,£,) to be 


Q(z,Q = [v 1 fz)v 2 (z)j Q 


vj(£) 

V 2 > 


where Q is a positive definite 2X2 matrix. The matrix V is 


V = 


simrz^ sin2irz^ 


simrz^ sin2-rrz2 


The controllability matrix G is the ‘2X4 matrix 

G = [v'Bj ; Av'b^ 

P-^siniTZ^ p^sintrz^ X^p^simrz-^ X^p^sinirz^ 
(3-^sinpirz^ p^si^-n-z^ X^p^s in2trz^ X^p^si^iTZ^ 


The first two column vectors are linearly independent for all choices 
of z^ and z^ e (0,1), since 

= p^P^f simrZjSin2irz2 -simrz^sin 2 irz^] 
= 2PjP^simrZj simrz^ [ costtz^ - cos ttz^] 

sinirz^ > 0 on 

(0, 1) and cosrrz is monotonically decreasing on (0,1). Thus, the 
rank of G is 2 and Eq. 5.3, 14 has a positive definite solution. If 
there is only one control fk=l) we have 

f P^sinirz^ X^p^simrz^ ~1 


which is not equal to zero for z^ ^ z^, because 


sirnrz. 


det 


(3-^simrz^ p^simrz^ 
P^sin2irz^ B^sh^irz^ 


G = 


p^sin2irz^ X^p^si^-ri-Zj J 

which has rank 2 since X^ and X^ are distinct. Note that in the two 
control case we do not require that X^ and X^ are distinct. 
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In order to actually compute an optimal solution we assign the 


following values: 


B 1 = R = Q = 


If we denote the matrix K to he 


1 0 


0 1 


k ll k 12 


|^ k 12 k 22 J 

then the matrix equation ^5,3.9) yields the three scalar equations 
2TT 2 k n + Ak^+ 2Bk n k 12 +Ck^ 2 - 1 = 0 

5ir2k 12 + Ak ll k 12 + Bk 12 + Bk U k 22 + Ck 12 k 22 =0 {5.3.1 

8ir k 22 + Ak 12 + 2Bk 12 k 22 + Ck 22 1 = 0 

, . .2 . . 2 
where A = sin irz^ + sm irz^ 

B = sin7rz^sin2trz^ + simrz 2 sin2Trz 2 
2 . 2 

C = sin 2-rrz^ + sin 2irz 2 

% 

A simplification can be achieved if we choose the control points z^ 

and z^ to lie symmetrically about the midpoint of the interval, z=^- , 

2 . 2 

that is, z 2 =l-z.. In this case A = 2sin TrZp 33 = 0, and C=2sin 2-jrZp so 

that we obtain as a solution of the set of Eqs . 5.3. 17 

2 / 4,-2 1 

-ir + / ir -t2sxn irz^ 

k n = 

2sm trz. 


k 12 “ ° 


- 4 *ir^+ 1/1677^+ 2 s in^ 2 irz . 


2 sin^2irz. 
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Thus, the optimal cost function is 

2 / 

-it v* tr 


J(X) = //> 


x(z) 


D D 


4. , . 2 
+Z s m itz . 


smirz 


, • 2 
2s m ttz. 


s imr t 


+ sin2irz 


4tt^+ ^/l'6ir^ r +2sin t '2irz 
2sin^2-rrz.. 


4,. , 2. 


sin2ir£ 


(5. 3 . 18) 


x{Ud^dz 


And, from Eq. 5.3.13, the optimal control is 


u*{t) = 


2^ ■ 2 
-7r + ^/TT -fZsm TTZ, 


2 s in-rr z. 


1 


/ 


D 


simr£,x{t, £,)d£, 


-4ir^+ 


6 u 4 + 2s in^ 2-irz .. 


2sin2irz 


1 


/ 

D 


sin2ir^xft, t)d£. 


f 5 . 3 . 1 9) 


The resulting optimal closed-loop system is illustrated in Fig. 2, where 
we again use the conventions adopted in Section 5,2 for Fig. 1. 


u(t,z) = B 0 u*(t) 



Dynamical System 

dx <3 2 x . , 

ai'dz 2 (n 1 


x(t,z) 


POINTWISE 

u*(t) 

CONTROL 

c 

OPERATOR 


B 0 



-tt?- + V-tr^ + 2 s'\f\ 2 -Tr 

r sin 7 r£ 

[]* 

2 sinirZ| q 

— — — 

•r 47 r 2 + Vl 6 ir 4 + 2 sin 2 2 TrZi 

x 

r sin 2 tt 

%]* 

2 sin 27 rz < J 

1 D 


Fig. 2 Closed-Loop System for Example 5.1 
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Note that if we had measuring devices which yield the measurement 
vector 

J*sinir^ x(t, t)df, 


xA-) 


D 


J sin2ir£x(t, £,)d£ 

D 

then y_(t) can be fed directly through the diagonal gain matrix 


M = 


-^ 2+ 


4 4-2sin 2 irZi 


2simrz, 


A 


4.„ . 2. 


-4tt + /l6ir +2 sin 2-rr z 


2sin2ir 


*1 


to obtain the optimal control 


u* - M y(t) 

Clearly, the measurement does not depend on the control point location; 
only the gain matrix M does. This indicates there is a decoupling of 
the measurement and control problems in the sense that changing the 
control point locations does not modify the basic types of measuring 
devices in use. Thus, the design procedure of "trying" different con- 
trol points in order to reduce some average cost does not interfere with 
the- basic structure of the closed-loop system. 

This problem of minimizing some average cost with respect to 
control point location can be done analytically as a parameter optimi- 
zation problem. For example, if we consider the optimal cost function 

for example 5. 1, given by Eq. 5.3. 18, and take the average cost over 

2 

the unit ball in L (D) , we obtain 


avg 


1 

2 


2 / 
-TT T ^/ir 


4 ? . 2 
+ 2sm irz. 


-4ir Z + 


+ 


6 tt 4 +2 s in 2 2-irz , 


2 s in 7rz 


1 


2sin 2 2irz. 
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Differentiating with respect to z^, equating the result to zero and 

solving this equation for z^ (hopefully, a solution exists in (0, 1)) 
results in the "optimal" control point location (end of^example). 

Let us now consider the question, touched on briefly in the pre- 
ceding section: when can a set of mea s urements of the form 


r.ft) - /m. 


(z)x(t, z)dz ; i=l,2,..,n 


whe re {rm(z)}^_^ is an, as yet, unspecified set of functions, be fed 

back directly t o obtain the optimal pointwise control ^ Let us assume 

that z is a scalar and that each measurement function im(z) may be 

i ""1 n 

written as a linear combination of the elements of {z that is, 

each measurement function is a polynomial of order n-1. The vector 
m(z), with i^ element m.(z), can then be written 

m / zi = W<q( z) (5.3. 

where W' is an nXn matrix and cqfz) is the vector 


qjz) = 


If we choose a cost criterion of the form 


J = J [i'(t)l(t)tu'(t)Ru(t)] dt 


then can b e written in the form 


yhtVyft) = J" J Q(z,£)x(t, z)x(t, (h dzdC 
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where 

Q(z, £) = m‘(z)m(U (5,3.21) 

We shall now proceed to show that under certain conditions the optimal 
feedback operator K(z, £,) for this choice of Q(z, t) is of the form 

K(z, £,) = m'(z)Km(D (5.3.22) 

where K is an nXn positive definite matrix. Using Eqs . 5.3.21 and 
5.3.22 in the Riccati equation 5.3.1, we obtain 

-A*m’(z)Km(£) - A*mVz)Km(£) + m l (z)KY'BR" 1 BYI^(f,)-m , fz)mft) = 0 

(5.3.23) 

where Y is a kX n matrix with Y.. = m.(z.). Us ing ( 5 . 3 . 20) we see 

■^J J A 

that 

A* m(z) = A*Wa(z) = WA; a (z) ' 
z — z z 

Since A* is a differential operator, we may write 
z 

a* a t z ) - Qs.' z ) 

Z 

where C is a lower triangular nXn matrix. For example, suppose 

a 2 

"A" 1 ' - = and n=4, then 

z «_2 



If we assume that W is nonsingular, then we may write 

A*m(z) = WC a (z) = ffCW'm(z) 
z 

Let us denote the nXn matrix A by 

A = WC W" 1 


= Ca(a) 
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Then the Riccati equation (5.3,23) may be written 

-A'K - KA + KY t BR~ 1 BY K - I] m (£,) = 0 (5.3,24) 

Thus, a solution K(z,£) of the form specified in Eq. 5,3.22 exists 
if and only if a positive definite solution of the matrix equation 

- A 'K - KA + KY' BR _1 B YK - I = 0 (5.3,25) 

exists. Once again, much as in the finite modal analytic case,, existence 
hinges on the controllability of the finite dimensional system 

x = Ax+Y’Bu 

The optimal control is now given by 

u*(t) = -R _1 BYK J m(z)x(t, z)dz = -R _1 BYKx(t) (5,3.261 

D 

that is, we have the desired result of directly feeding back the output 
of our measuring devices. This is a somewhat startling result in that 
it is impossible, in finite dimensional systems, to have optimal output 
feedback. This can be explained, however, by the fact that infinite 
dimensional systems the output is of lower dimension than the state and 
is written 

I = Cx 

where C is not a square matrix. Thus, if we tried to assume that a 
solution K of the matrix Riccati equation 

-A'K - KA + KBR _1 B ’K - C'C = 0 
were of the form C'KjC , where satisfies the equation 

-A'K - KjA + KjCBR^B'C'^ - I = 0 

with the resulting optimal control given by 

u*(t) = -R _1 B 'C 'KjCxft) = -RB'C’K^ft) 
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i.e. , output feedback, we would not be able to verify that K = C'K^C 
is indeed a solution of the first matrix Riccati equation. This inability 
to satisfy the original Riccati equation occurs from the fact that 

C'K^C A / C'K^A C 

that is, the output matrix C and the system matrix A can never commute 

when C is riot a square matrix. This stumbling block is avoided in 

our output feedback derivation, because of the fact that (1) we are using 

differential operators (A* and AT) and (2) the relation A* m(z) = 

z c z 

Am(z) holds, so that there is no problem in deriving a matrix Riccati 

equation for K given in Eq. 5.3.21 . 

To illustrate this result, let us consider the following simple 
example: 

Example 5.2 : Let us, once again, consider the scalar heat 

equation and assume that we have a single pointwise control and a 
single measuring device which gives the average temperature yft) 
over the spatial domain (0, 1), or, more specifically, 

1 

y/t) = J" x{t,z)dz 
0 

If we wish to minimize 


co 

J - J [y 2 (t) + ru 2 (t)] dt 
0 


then we can place this problem within the framework of the preceding 
result by observing that _q(z) is the scalar 1, W is the scalar 1, 
and from the fact that 


a; a(z) 



(1) = o 
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the matrix A is the scalar 0. The matrix Y is also the scalar 1 and 
the matrix B is the scalar (3^. The Riccati equation (5.3,25) thus be- 
comes 


which has the 


3 ^ 

lk 2 ,l 

r 


"positive definite" 


solution 


k 


\fr 



The optimal control is given by 



which shows that we directly feed back the average of the temperature 
distribution on (0, 1). 

To summarize the results of this section, we have shown that 
by a judicious choice of the state -weighting kernel Q(z,t,) one is able 
to derive the fin ite m odal approximation to the pointwise control problem, 
from which it was possible to conclude the following: 

1. The optimal control law over the class of control laws 

which feed back only the modes under consideration is 

optimal over the class of all feedback control laws, 

th 

2. The optimal cost for the n modal approximation mono- 
tonically increases with n. 

Neither of these two conclusions can be made very easily using straight- 
forward modal analytic techniques. In the example presented it was 
shown that the feedback structure of the pointwise control system can be 
separated into a measurement part, which is independent of control 
point location, and a gain part, which depends directly on control point 
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location. We next considered the problem of having only n measure- 
ments of the state distribution available, rather than the entire state 
distribution, and we were able to show, under the assumption of a 
particular form for these measurements, that the optimal feedback 
control law consists of directly feeding back these measurements. 



■CHAPTER VI 


CONCLUSIONS AND RECOMMENDATIONS FOR FURTHER RESEARCH 

In the preceding chapters we have considered the problem of 
minimizing a quadratic cost criterion in systems described by linear 
parabolic partial differential equations. We have shown that optimal 
controls exist both in the case where the system operator is coercive 
and in the case where the system operator is the infinitesimal gener- 
ator of a semigroup of operators. The optimal control is given by a 
bounded linear transformation of the state of the system. The resulting 
optimal feedback operator was shown to be the solution of an operator 
differential equation of the Riccati type. By application of the Schwartz 
kernel theorem the feedback operator was shown to be represented by an 
integral operator whose kernel satisfies an integro -differential equation 
of the Riccati type . 

Using these results for general parabolic optimal control prob- 
lems, we were able to specialize to the case of pointwise control. It 
was shown that the optimal pointwise control is also given by a state 
feedback law, which, in this case, is of a simpler form than that of the 
distributed control case. We were also able to use the general re- 
sults to derive the modal analytic approximation to the optimal point - 
wise control and to show that for a special class of state measuring 
devices the optimal pointwise control is given by a linear feedback oper- 
ation on the measured quantities . 

It is felt that, in addition to the results obtained for the optimal 
pointwise control problem, this research represents a philosophical 
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contribution to distributed parameter control theory. The general 
parabolic optimal control problem was formulated in such a way as to 
resemble as closely as possible an analogous problem. in finite di- 
mensional control theory. This approach leads to the ability, at many 
junctures, to make direct extensions of finite dimensional results to 
corresponding distributed parameter results using only the simplest 
analytic tools. 

There are several areas touched on in this thesis which remain 
open topics for research. The hyperbolic optimal control problem was 
introduced in Definition 3.7. The remainder of the thesis was devoted 
to parabolic optimal control problems, but it seems that a parallel 
development for hyperbolic problems would entail using the variation of 
constants formula (2.8.6) to eventually derive an optimal matrix feed- 
back operator which is the solution to a matrix Riccati operator equa- 
tion. Presumably, it would then be a straightforward matter to derive 
the optimal pointwise control for the hyperbolic case. 

In example 5.1 of Section 5.3 brief mention is made of the opti- 
mal point location problem. Although done for a special case, one 
could, in the general case, take the optimal cost function resulting from 
an arbitrary set of control point locations, average the optimal cost 
function in order to eliminate dependence on the initial state, and then 
optimize the averaged cost function over the set of allowable control" 
points . 

Finally, the output feedback problem considered in Section 5.3 

might be generalized to the distributed parameter analog of the finite 

32 

dimensional problem solved by Levine, namely, the determination of 
the linear feedback operation on the output which minimizes some 
averaged cost functional. 



APPENDIX A 


INFINITESIMAL GENERATOR THEOREM 

If the Assumptions. 2 . 7 . 1 , 2.7.2, and 2,7.3 hold then the oper- 
ator Ag is the infinitesimal generator of a strongly continuous semi- • 
group ^ defined on H"(D). The function on D repre- 

sented by ${t)x, namely (<S?(t)x)(z), is analytic in t andm-times different- 
iable in the components z^ of z for t> 0. Moreover, if x^eH^fD) 
there exists a unique function x(t,z) defined for t > 0 and zeD such 
that 

i. x(t)eH m (D) , VdetOjOD) 

o *. 

ii. lim J|x(t) - x(s) || =0 , se[ 0, oo) 

t— s H (D) 

o 

iii. x(0) = x 

o 

iv. x(t)e Do(A^) ^ t >0 

v. lim ||A x(t) - A„x{s) || = 0 , se(0,oo) 

t— s 5 H^(D) 

vi. x(t, z) = Ax(t,z) ; te(0, oo) , zeD 

vii. x(t,z) = (${t)x o )(z) 
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APPENDIX B 


VARIATION OF CONSTANTS FORMULA 
23 

R. S. Phillips proves the following re suit: Let A be the 

infinitesimal generator of a strongly continuous semigroup of oper- 
ators {<£{t)}^|-Q ^ and let f(t) be strongly continuously diffe rentiable 
on (0,co) . Then for each xeDo(A)C X there exists a unique continu- 
ously differentiable function y(s) : [ 0,co) — H^(D) such that the system 


y(t) = Ay(t) + f(t) , y(0) = x 


(B . 1) 


has the solution 


t 

y(t) = <Kt )x+J* ${t -cr)f(cr) d<r (B.2) 

0 

The requirement of strong continuous differentiability on f(t) is 

required in order that y(t) be continuously differentiable. If we de- 

t t+h 

note g(t) =y ${cr)[ f(t+h-cr) -f(t-cr)] dcr + J ${o-)f{t+h-a-)dir 
0 t 

The integrand in the first term is bounded and converges pointwise to 
zero as h— >-0 . The integrand in the second term is bounded as h-*-0, 
so that li g(t+h) -g{h) [| — *-0 as h—0, implying strong continuity of g(t). 
Dividing g(t+h} -g(t) by h # noting that {[ f{t+h-cr) -f(t-cr)] -f 1 {t-o - ) } || 

converges boundedly to zero as h— *0, and noting that |[ ${cr)f{t+h-(r) 
-<£{t)f(0) |[ — 0 as h — 0 for cre[t,t+h] , we can write 

t 

g(t) = $<t)f{0) + y $(t-<r)f»d<r 
0 

and the strong continuity of y follows from the strong continuity of f’ (t). 
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It should be noted that the application in which this result is 
used in Chapter II, Section 8 requires that (B.2) holds only for 
xeDoJA^) rather than Do(A). 



APPENDIX C 


■ PROOF OF THEOREM 4 . 2 

. ± 5 

In this appendix a proof of Theorem 4.2 due to Lions 
is presented. This theorem is used in Section 4.2 to prove existence of 
solutions of the parabolic system equation (3 . 2. 2) and in Section 4.3 
to derive necessary conditions for optimality in the coercive system 
operator case. Let us restate the theorem for convenience. 

Theorem 4. 2 : If II(u,v) is a symmetric, continuous, coercive 

bilinear form on DX.U, and L(u) is a linear form on U, then the 
cost functional J{u) = II (u, u) -2L(u) has a minimum value J(u*), 
if and only if u* satisfies the equation 

n{u*,v) = L{v) , V- veU (C.l) 

Proof : Suppose u* is the minimizing element of the space U 

J(u*) < J( 1 - 0)u* + 0U>) V-ojeU and 0e[O,l] 

or -|[ J(u* + 0(w-u*)) - J(u*)] > 0 

In the limit as 0—0, this expression is the Frechet de- 
rivative of J(u) at u=u*, which implies that 

= 2[ n{u s 5 : , co-u*) -L( oj-u*)] > 0 

u=u* 

(C.2) 

Since (C.2) must hold for all perturbations co-u*, both 

✓ 

positive and negative, it must be true that 
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n (U* , to-u*) = L(w~u*) for all toeU 

Since oj is any vector in U, w-u* = v, any vector in U, 
s o that Eq . C , 1 hold s . 

The "if" part of Theorem 4.2 is proved by using the 
convexity of J(u) to show that 

J(v) - J(u*) > -|[ J(l-0)u* + 9v) - J(u*)} ¥veU, '0e[ 0,1] 
which, in the limit as 9— 0, yields 

J(v)-J(u*)><-|~ , v> = 2[n (u*, v)-2L(v)] = 0 

u=u* 

implying 

J(u*) <J(v) VveU 



APPENDIX D 


■UNIFORM BOUNDEDNESS PRINCIPLE 

In the proof of Theorem 4.3 and in other later theorems we appeal 

to the uniform boundedness principle to obtain a uniform bound on the 

operators $(t) and B(t) on a finite interval [0,T]. The uniform 

boundedness principle, or, as it is sometime referred to, the Banach- 

Steinhaus theorem, is as follows: 

Suppose X is a Banach space, Y is a normed linear space, and 

(A } is a collection of bounded linear operators of X into Y, where 
a 

a ranges over some index set A..Hitis true that 

sup || A x|| < oo 
aeA 

for all x . in a dense subset of X, then there exists an M < CD, such 
that 

(jA^ II < M for ae A 

A straightforward proof of this theorem is given by Rudin. 2 ^ The 
collection of operators { ^ (t) q T] anc *' ' { t) )t e [ o T] are co ^~ 
lections of bounded linear operators from one Hilbert space into an- 
other, and the index set A is [0,T], so that, by this theorem , we 
can write ||${t) || < M and ||B(t) |[ < B . 
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APPENDIX E 


POSITIVITY OF THE DIFFERENCE OPERATOR 5V n+1 (t) 

We show in this appendix that the solution 6V n _^(t) of Eq. 4.6. 19 
is positive on the interval [0, T] . Rewriting the equation for con- 
venience 

6 \+l (t) = -6V n+ i(t)A- A *6V n+ i {t) + 6V n+1 (t)B(t)R- 1 {t)B^{t)V n {t) 

+V n (t)B(t)R _1 (t)B*(t)6V n+1 (t) - N(t) ; SV^T) = 0 (E.l) 

Considering the parabolic system defined on the subinterval [ s,T] 

x(t) = Ax(t) -B(t)R -1 (t)B*(t)V n (t)x(t);- x(s) = x^DofA^ (E.2) 

a. 

we examine the expression^ 

T 

<6V n+1 {t ) x(t),x{t)>|J = / “<5V n+1 (t)x(t) J x(t)>dt 

s 

T T - T 

= /<6V n (t)x(t),x(t»dt + /<5V n (t)x(t),x(t)> + /< 5V n (t)x(t),x(t)> dt 
s s s 

and use Eqs . E.l and E.2 to eliminate 6V n+ ^(t), x (t) 3 and SV n+1 {T) 

obtaining 

T 

<6Vn + i(s) x s ,x s > = J*< N (t)x(t) , x(t)> dt >0 

s 

since N(t) is a positive operator on [ 0, T] and proving the positivity 

of 8V ,.(t) on [0,T] since the initial time s 
n-rl 

interval. 

All inner products are taken in H^fD). 
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